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© DNA polymerases having modified nucleotide binding site for DNA sequencing. 

© Modified gene encoding a modified DNA polymerase wherein the modified polymerase incorporates 
dideoxynucleotides at least 20-fold better compared to the corresponding deoxynucleotides as compared with 
the corresponding naturally-occurring DNA polymerase. 
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This application is a continuation-in-part of Tabor and Richardson, entitled "DNA POLYMERASES 
HAVING MODIFIED NUCLEOTIDE BINDING SITE FOR DNA SEQUENCING, n filed October 17, 1994, the 
whole of which (including drawings) is hereby incorporated by reference herein. 

5 Background of the Invention 

This inv ntion was made with government support including a grant from the U.S. Dept of Energy, 
contract number DE-FG02-88ER60688. The U.S. government may have certain rights in the invention. 

This invention relates to DNA polymerases suitable for DNA sequencing and to automated and manual 
to methods for DNA sequencing. 

The following is a brief description of art relevant to DNA sequencing techniques. This is provided only 
to give general guidance to those reading the application, and is not an admission that any art cited herein 
or referred to explicitly or implicitly is prior art to the appended claims. 

DNA sequencing generally involves the generation of four populations of single-stranded DNA frag- 
rs ments having one defined iwrminus and one variable terminus. The variable terminus generally terminates 
at specific nucleotide bases (either guanine (G). adenine (A), thymine (T), or cytosine (C)). The four different 
sets of fragments are each separated on the basis of their length, one procedure being on a high resolution 
polyacrylamide gel; each band on the gel corresponds coiinearly to a specific nucleotide in the DNA 
sequence, thus identifying the positions in the sequence of the given nucleotide base. See Tabor and 
20 Richardson, U.S. Patents 4,942,130 and 4,962,02 r 

There are two general methods of DNA sequencing. One method (Maxam and Gilbert sequencing) 
involves the chemical degradation of isolated DNA fragments, each labeled with a single radiolabel at its 
defined terminus, each reaction yielding a limited cleavage specifically at one or more of the four bases (G. 
A, T or C). The other method (dideoxy or chain-termination sequencing) involves the enzymatic synthesis of 
25 a DNA strand. Sanger et a/. (Proa Nat. Acad. ScL USA 74:5463, 1977). Four separate syntheses are 
generally run, each reaction being caused to terminate at a specific base (G, A, T or C) via incorporation of 
an appropriate chain terminating nucleotide, such as a dideoxynucleotide. The latter method is preferred 
since the DNA fragments can be uniformly labelled (instead of end labelled) by the inclusion of a 
radioactively labeled nucleoside triphosphate and thus the larger DNA fragments contain increasingly more 
30 radioactivity. Further, ^S-labelled nucleotides can be used in place of 32 P-labelled nucleotides, resulting in 
sharper definition; the reaction products are easier to interpret since each lane corresponds only to either G, 
A, T or C. The enzymes used for most dideoxy sequencing include T7 DNA polymerase and DNA 
polymerases isolated from thermophilic organisms such as Taq, Vent. Tth. and others. Other polymerases 
used to a lesser extent include AMV reverse transcriptase and Klenow fragment of £ coli DNA polymerase 

35 I. 

in the dideoxy chain terminating method a short single-stranded primer is annealed to a single-stranded 
template. The primer is elongated at its 3'-end by the incorporation of deoxynucleotides (dNMPs) until a 
dideoxynucleotide (ddNMP) is incorporated. When a ddNMP is incorporated elongation ceases at that base. 
Other chain terminating agents can be used in place of a ^dNTP and the ddNTP can be labellea as 
40 discussed below. 

Using the above methodology, automated systems for DNA sequence analysis have been developed. 
One instrument, which was manufactured by EG&G, makes use of conventional dideoxy chain terminating 
reactions with a radioactively labeled nucleotide. The resulting DNA products are separated by gel 
electrophoresis. Toneguzzo et af , 6 Biotechniques 460. 1£:3. A detector scans for radioactivity as it 

45 passes through the bottom of the gel. Four synthesis reactions are required for each template to be 
sequenced, as well as four lanes on each gel. a separate lane being used for products terminated by each 
specific chain terminating agent. 

Kambara et al , 6 Biotechnology 816. 1988. have used a fluorescent-labelled primer. The resulting 
fluorescently labelled products are excited with a laser at the bottom of the gel and the fluorescence 

so detected with a CRT monitor. This procedure also requires four synthesis reactions and four lanes on the 
gel for each template to be sequenced. 

Applied Biosystems manufactures an instrument in which four different primers are used, each labelled 
with a different fluorescent marker. Smith et aA. 13 Nuc. Acid. Res. 2399, 1985; and 321 Nature 674, 1986. 
Each primer is used in a separate reaction containing one of four dideoxynucleotides. After the four 

55 reactions have been carried out the mixtures are combined and the DNA fragments are fractionated in a 
single lane on a gel. A laser at the bottom of the gel is used to detect fluorescent products after they have 
been electrophoresed through the gel. This system requires four separate annealing reactions and four 
separate synthesis reactions for each template, but only a single lane on the gel. Computer analysis of the 
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of a normal analog. The polymeras s of this inv ntion also d not discriminate significantly against some 
other analogs. This is important since, in addition to the four normal deoxynucleoside triphosphates (dGTP, 
dATP, dTTP and dCTP), sequencing reactions require the incorporation of other types of nucleotide 
denvatives such as: radioactively- or fluor scently-labelled nucleoside triphosphates, usually for labeling the 
synthesiz d strands with ^S, ^P, or other chemical agents. When a DNA polymerase does not discriminate 
against analogs the same probability will exist for the incorporation of an analog as for a normal nucleotide. 
For labelled nucleoside triphosphates this is important in order to efficiently label the synthesized DNA 
strands using a minimum of radioactivity. [4,942,130, COL 5:5] 
They also state: 

The ability to produce nearby bands of approximately the same intensity is useful since it permits the 
results of any sequencing reaction to be read more easily and with greater certainty. Further, since the DNA 
products from a sequencing reaction with a specific chain terminating agent form bands which are of 
approximately the same intensity as that of nearby bands, band intensity itself provides a specific label for 
the series of bands so formed. The number of DNA products of approximately the same molecular weight 
produced by a given chain terminating agent varies depending upon the concentration of the chain 
terminating agent. Thus, by using a different concentration of each of the four chain terminating agents for 
the syntnesis the DNA products incorporating one chain terminating agent are distinguished from DNA 
products of approximately the same molecular weight incorporating other chain terminating agents in that 
they differ in number or amount; consequently, the bands of DNA products can be identified as to chain 
terminating agent simply by their intensity as compared to the intensities of nearby bands. As a result, two 
or more series of DNA products, each series having a different chain terminating agent, can be subjected to 
gel permeation in a single lane and identified, i.e.. distinguished from each other, by the intensity of each 
band as compared to the intensity of nearby bands. Moreover, the syntheses of DNA products incorporating 
different chain terminating agents need not be carried out separately, in separate containers, but may all be 
carried out simultaneously in a single reaction vessel; and the same label, e.g., radioisotopic, fluorescent, 
etc. can. if desired, be used for all chain terminating agents instead of a different label for each, thus 
simplifying the procedure. [4,962,020, col 3:1-35] 

See also Tabor and Richardson Proc. Nat!. Acad. ScL USA 86, 4076 -4080 (1989) which indicates that 
substitution of manganese ions for magnesium ions for catalysis by 17 DNA polymerase or E. coll DNA 
polymerase reduces the discrimination of these polymerases for ddNTPs by 4-100 fold, and Tabor and 
Richardson J. Biol. Chem. 265, 8322-8328 (1990) which describes the use of pyrophosphatase and 
manganese ions to generate dideoxy-terminated fragments of uniform intensity using T7 DNA polymerase. 

Summary of the Invention 



Applicant believes that the lesser utility of some DNA polymerases for dideoxy DNA sequencing is due 
in part to the reduced ability of those polymerases to incorporate a ddNMP (or other nucleotide analog) in 
place of a dNMP. As noted above, the ability not to discriminate allows use of lower concentrations of 
ddNTPs than with enzymes that do discriminate, and most important it providas banding patterns in 
sequencing gels that are of more uniform intensity along their length. Both these results make automated 
sequencing with the enzyme easier and more profitable in that longer DNA sequences can be determined 
with greater confidence. 

The present invention provides a method by which an otherwise discriminatory DNA polymerase can be 
altered to discriminate less against ddNTPs than the corresponding naturally occurring enzyme. This 
method involves the modification of the polymerase genetically to provide amino acid residues at key 
locations that enhance the ability of the polymerase to incorporate analogs of dNMPs. Applicant has 
determined that amino acid changes in a specific region of DNA polymerases have a dramatic effect on the 
ability of those DNA polymerases to incorporate dideoxynucleotides; the specific amino acid residue 
inserted determines whether the polymerase is more or less discriminatory towards ddNTPs. Applicant has 
determined that modifying DNA polymerases so that they incorporate dideoxynucleotides more efficiently 
has a tremendous effect on their utility in DNA sequencing. It is possible that such modified DNA 
polymerases will also prove useful for other common molecular biology procedures such as the amplifica- 
tion of DNA (for example by the polymerase chain reaction), in vitro mutagenesis and filling in the ends of 
DNA fragments. The combination of this technology with existing knowledge for alteration of 3'-5* 
exonuclease activity of a DNA polymerase (such as T7 DNA polymerase as described by Tabor and 
Richardson, supra), and the use of manganese and pyrophosphatase in sequencing reactions will allow the 
production of significantly superior enzymes to those presently known. 
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In on particular aspect, namely, DNA sequencing, thermophilic enzymes having an ability to catalvze 
the polym rizati n of nucleotid s at t mperatures above 50 -C and. in particular, above 60 * C 70 «C or 
even 80 -C, under conditions used for ONA sequencing, are well known in the art. Such nzymes 'are 
g nerally present in organisms that grow at these temperatures. However, applicant believes that many of 

s th se i enzym s suffer from limitations, including the limited ability to incorporate dideoxynucleotides By 
modification of these enzymes using methods shown below, those in the art can now modify any desired 
thermophilic DNA polymerase to make it incorporate dideoxynucleotides more efficiently. Such enzymes 
w,ll be supenor to those existing in the present day for DNA sequencing both in automated machines and in 
manual sequencing, especially in procedures known as cycle sequencing. In cycle sequencing multiple 

io rounds of DNA synthesis are carried out from the same template, with the synthesized strand removed after 
each cycle by heat denaturatjon; this allows much smaller amounts of DNA template to be used in a 
sequencing reaction 

Applicant has determined experimentally that amino acid residue 526 in the relatively non-dis- 
cnm.nat.ng enzyme T7 DNA polymerase provides it with this property. Applicant has determined that by 
is modification of residue 526 it is possible to increase the ability of T7 DNA polymerase to discriminate many 
fold Based on the amino acid homologies between T7 DNA polymerase and other DNA polymerases 
applicant has determined that altering the residue at the homologous site in other DNA polymerases 
likewise affects their ability to discriminate against dideoxnucleotides. Examples of such homologous sites 
are residue 762 of f. coli DNA polymerase I and residue 667 of Taq DNA polymerase. In all three of these 
20 examples, applicant has shown that it is important that the residue at this site is different from phenylalanine 
(F) e.g .. it may be tyrosine (Y) as in T7 DNA polymerase. Surprisingly, modification of this single amino 
acid residue, even by the addition of a single hydroxyl group, provides a very large alteration (250 - 8 000 
fold) in d.scrimination levels. Those in the art will recognize that changes at this one site are not limiting in 
th.s invention, and changes at other sites that decrease the ability of a polymerase to discriminate can now 
25 be readily found by routine experimentation. For example, applicant has found that modification of T7 DNA 
polymerase at 13 other sites also results in an increased ability of the enzyme to discriminate against 
ddNTPs. however the effect of alterations at these sites is much less, only 5 - 20 fold. By use of analogous 
procedures, other sites that effect discrimination against ddNTPs can be readily identified in other DNA 
polymerase to make them more useful for DNA sequencing. Such other sites include amino acid residues in 
30 reg.ons highly homologous between E. coli DNA polymerase I and T7 DNA polymerase, since these 
regions are likely to make up in part the binding domain for ddNTPs; in E. coli DNA polymerase I these 
regions include amino acids in regions analogous to those in T7 DNA polymerase from conserved or non- 
conserved amino acids in regions 665 - 681 and 754 - 783. and possibly in regions 709 - 734. 797 - 866 
and 913 - 927. The amino acid change to provide the desired function can be chosen to be identical to the 
as corresponding amino acids of a non-discriminatory enzyme like T7 DNA polymerase, or other functionally 
equivalent amino acids which can be chosen by routine experimentation. By changing non-conserved amino 
acids more profound alteration of the ability to discrimination is obtained. Non-conserved amino acids are 
those which vary from one species of polymerase to another (;.e.. are found in less than 50% of 
polymerases). The term "analogous- is used in its commonly recognized manner. Thus, an analogue of a 
40 Pol I polymerase is one having an amino acid sequence as described by Braithwaite and Ito, infra which is 
preferably as related to the other members of the Pol I family of polymerases described therein as is Spo2 
DNA polymerase. Such analyses can be performed using Felsenstein's PHYLIP program. Id. 

Thus, in a first aspect the invention features a modified gene encoding a modified DNA polymerase 
The gene is modified to produce a modified DNA polymerase that, compared to the corresponding 
45 naturally-occurring or unmodified DNA polymerase, has an increased ability to incorporate a dideox- 
ynucleotide compared to a deoxynucleotide. 

By "increased ability" is meant that the DNA polymerase is able to better incorporate a dideox- 
ynucleotide. That is. it discriminates to a lesser extent than a corresponding naturally-occurring DNA 
polymerase against a dideoxynucleotide compared to a deoxynucleotide. Specific methods for measuring 
so such discrimination are provided below. The term "increased" means to provide a measurable difference in 
ability to incorporate such dideoxynucleotides. In preferred embodiments, this is a increase of at least 10% 
compared to the naturally-occurring enzyme, although it is preferred that the level of discrimination against 
a dideoxynucleotide is reduced by at least 10 to 100 fold and preferably by 100 - 500 fold. One example of 
such an enzyme is E. coli DNA polymerase I which (as noted herein) discriminates approximately 140 - 
55 1 ,1 00 fold against incorporation of dideoxynucleotides compared to deoxynucleotides. By the method of this 
invention an enzyme can be derived (by alteration of only one or two amino acids) that actually prefers 
ddNTPs over dNTPS - that is, the ability of the polymerase to incorporate dideoxynucleotides has been 
increased by an average of 1 ,000 fold. 
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Th phrase "corresponding naturally-occurring ONA polymeras " is one well known in the art and 
ref rs to the polym ras found in nature, and which preferably is not alt red ither by in vitro or in vivo 
manipulations in th laboratory. Similarly the corresponding nucl ic acid is that nucleic acid encoding a 
DNA polymeras found in natur . This is simply used as a base line to compar modified nucleic acids 
s ncoding such polym rases. Thus, a base line for the DNA polym ras of Thermus aquaticus (also termed 
m Jaq n ) is the nucleic acid which naturally encodes Taq DNA polymerase present in the bacterium Thermus 
aquaticus. Applicant provides at least one site which can be changed in such polymerases to alter th 
ability of the polymerase to incorporate a dideoxynucleotide. These sites are merely examples and are not 
limiting in this invention since those in the art armed with the knowledge that the ability of a DNA 
io polymerase can be usefully altered in this property are now provided with the methodology by which to 
alter such enzymes either at these specific sites or at other equivalent sites. 

In embodiments of the invention described herein, modification of the DNA polymerase is by substitu- 
tion of one or more amino adds. It may however be found that the modification may take the form of the 
insertion of one or more additional amino acids or the deletion of one or more amino acids. 
15 DNA polymerases of this invention may also be modified to remove or alter an exonuclease domain, 
such as the 3* - 5' exonuclease activity described by Tabor and Richardson, supra, or the 5' - 3' 
exonuclease activity in Taq described by Barnes (WO 92/06188). The mutations that alter the ability of DNA 
polymerases of this invention to discriminate against ddNTPs preferably do not affect the exonuclease 
activity substantially; by this it is meant that the mutations are in the polymerase domain of the enzyme, 
20 near the active site for polymerization, and are not decreasing discrimination merely by reducing the ability 
of the polymerase to remove incorporated analogs via its exonuclease activity. Particularly suitable DNA 
polymerases of this invention are Pol I - type polymerases as described by Braithwaite and Ito, 21 Nuc. 
Acid, Res. 787, 1993, hereby incorporated by reference herein, and referred to as Family A, and 
polymerase alpha or polymerase II - type DNA polymerases described by Braithwaite and Ito. and referred 
25 to as family B. The other polymerase families described by Braithwaite and Ito may also be used in this 
invention. In particular, applicant has found that the presence of a polar, hydroxyl containing amino acid 
residue at a position near the binding site for the dNTP substrate is important for the polymerase being able 
to efficiently incorporate a dideoxynucleotide. Without being bound by any theory, Applicant believes that 
this finding is contrary to the expected result that high discrimination against a nucleotide without a hydroxyl 
30 group at the 3' position of the ribose moiety (/.e. a ddNTP) requires the simultaneous absence of a hydroxyl 
group on the amino acid residue at this critical site, tn other words, the presence of the gap, or hole created 
by the absence of both hydroxyl groups leads to discrimination against the analog. Knowledge of this result 
provides an approach to finding the critical residue in even distantly related DNA polymerases; the addition 
of a residue with a polar group for a non polar one in the region where the dNTP binds is a useful candidate 
35 amino acid change for decreasing the ability of the polymerase to discriminate against ddNTPs. For 
example, the phenylalanine at position 272 of rat DNA polymerase b, a DNA polymerase with little if any 
homology to polymerases of Family A or B, has been shown by X-ray diffraction studies to be in contact 
with the 3* position of the ddCTP residue in a ternary complex with a primer-template (Pelletier et a/., 264 
Science 189, 1994). Knowledge of the results described in this invention make the modification of this 
40 residue to tyrosine a logical choice in a screen of mutants of rat DNA polymerase b that incorporate 
dideoxynucieotides more efficiently. Those in the art will thus be likely to be able to alter the discriminatory 
phenotype of any DNA polymerase using the information provided herein. 

The ability of some polymerases of this invention to incorporate dideoxynucieotides more efficiently can 
be specific (/.e., the effect on dideoxynucleotide analogs is much greater than on other analogs). However, 
45 some of the polymerases are also useful to aid in incorporation of other base modified analogs (e.g., 
deoxyinosine triphosphate (dITP) and 2'-deoxy-7-dea2aguanosine 5Mriphosphate (dc 7 GTP) to remove 
compression of bands during electrophoresis, and fluorescently labeled deoxynucleotides or dideox- 
ynucieotides for use in automated procedures). In addition, such polymerases may be able to incorporate 
ribonucleotides more efficiently thus allowing synthesis of RNA without need for a promoter. Specifically, 
so the conserved motifs between single-subunit DNA-dependent RNA polymerases such as T7 RNA poly- 
merase and DNA polymerases of Family A (Pol I - type DNA polymerase) suggest that mutations in this 
region (residues 758 to 767 of E. coii DNA polymerase I) are likely to change the specificity towards rNTPs. 
This permits the engineering of RNA polymerases that efficiently initiated synthesis from a primer, 
eliminating the requirement for a promoter for the synthesis of RNA. Analogously, the data provided herein 
55 suggest that modifying residues 631 to 640 of T7 RNA polymerase will alter its specificity towards dNTPs. 
This permits the engineering of a new DNA polymerase that initiates DNA synthesis de novo from a 
promoter sequence, and cannot use a primer. 
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a t i«L »t a , J ment8, * e m0dified ° NA P° , y merase has suffi <=ient DNA polymerase activity tea 
at least that used in a standard sequencing r action, and preferably at I ast 100 units/mo of enJm «' 
denned in th art; or ferably the mutation in the polymerase does not alter the prior S^^SS^i 
fold) for use .n DNA sequencing (when combined with any host factor necessary for that DNA ^mer^ 
s activity); and has suffici ntly ,ow exonucl ase activity (e.g.. less than ScTuSI se *Z ™ 
R-chardson. supra) to allow the polymerase to be used in DNA sequencing; the DNA poUeraseTas o^e 
1«Z ° 6 r in0 3CidS * dideox V" uc '^a binding sit of a T7-type DNA^merase ^ ff! Z 

mS2 ST ? 9r ° UP COnS, ' Stin9 °' 77 T3 " 0 '- 0 " H " W31 ' O"" 1 - Y " ^d SP6). Preferable 

modified DNA polymerase ,s modified from a tfiermostable enzyme, such as the DNA polymerase encoded 

t y J hermU t T at ' CU5 ' Thermu$ """"OP"""*' ">*«»>" "a™. Bacillus sterothemoZluT aZve^t 
XEV - : ^ T* t0 inCOfporate 8 <**°Wleotide is incre^^ Sst 10-^3 

50-fold or most preferably at least 100-fold compared to the corresponding naturally^rina DNA 
polymerase, e.g., by a change at just one amino acid. aiurauy-occurnng dna 

In a second aspect, the invention features a method for production of a modified DNA polymerase 
>s hav.no an mcreased ability to incorporate a dideoxynucleotide compared to the ability of a co^ ZZ 

sxzzzr 0 t po,y ~ The method inc,udes providin9 a nuci * c ■* 

mnl^T muta 9 en,a "9 or otherwise altering the nucleotide base sequence of the nucleic acid 

molecule to incorporate one or more base changes in the nucleotide base sequence at one or more sites 

20 ZtXT V ?*' 3t •f St 10 ' 50 ° r m0St Pr8,erab,y ,0M0 ° ,0,d) *"*» abili * of the pTyme 2 
20 encoded by the nucleic acid to incorporate a dideoxynucleotide. 

nM ' n a 1 tWn ! 106 inventl 'o n fea t" r es a method for determining the nucleotide base sequence of a 

hvlrid^rh^MA" 19 ^ , inC ' Ud ! S Pr ° Vidinfl 8 ° NA molecu,e annea,ed with a P' im *' ^lecule able to 
££ir , J? ? t m ? CU ' e: inCUb8tin9 1,10 8nnea,ed mo,ecules in a vessel containing at least one 
deoxynucleot.de tnphosphate. a DNA polymerase modified from a naturally-occurring DNA polymerase to 
2S have an increased ability to incorporate a dideoxynucleotide compared to the naturally-occurring poly- 
merase (The Phrase has sufficient DNA polymerase activity and sufficiently low exonuclease activity to 
SrJnS" '°' ° N \^ uencin 9 ) Prided is at least one DNA synthesis terminating agent which 
terminates DNA synthesis at a specific nucleotide base. The method further includes separating the DNA 

» ZT^L°l,^ln^ bati y ? aCti0n aCC ° rdin9 t0 Si26 ' whereby 81 ,east 8 Dart °' th ° ""oleotide base 
30 sequence of the ONA molecule can be determined. 

In preferred embodiments, the ONA polymerase is a thermostable DNA polymerase and the sequencing 

1 P J cnT^ at a t6mper8tUre 8b0ve 50, °' 60 ' C ' or 70 * C ' ^ DNA polymerase is derived (Te.. has at 
least 50/- identity in amino acid residues) from one encoded by Thermus aquaticus. Thermus ther- 
mophilus. Thermus flavus. Bacillus sterothermophllus. Thermococcus litoralis (Vent) Pyococcus 

35 funosus (Pfu) or Sulfolobus solfatarlcus. ryrococcus 
In other preferred embodiments, the DNA polymerase has less than 1000. 250. 100 50 10 or even 2 
units of exonuclease activity per mg of polymerase and is able to utilize primers having only 4 6 or 10 
bases; and the concentration of all four deoxynucleoside triphosphates at the start of the incubating step is 
sufficient to allow DNA synthesis to continue until terminated by the agent, e.g.. a ddNTP 

<o For cycle sequencing, the polymerases of the present invention now make it possible to use 
significantly lower amounts of dideoxynucleotides compared to other enzymes. That is. the method involves 
providing an excess amount of deoxynucleotides to all four dideoxynucleotides in a cycle sequencing 
reaction, and performing the cycle sequencing reaction. For other enzymes, it was necassary to add an 
excess of at least one of the ddNTPs to such reactions. For example, Sears ef a/.. 13 BioTechniques 626 

45 1 992 describe use of about a 1 0 fold excess of ddNTPs to dNTPs with Vent polymerase, and Carothers ef 
a/.. 7 BioTechniques 494. 1989 describe use of at least 2 fold excess of ddNTPs to dNTPs for Taq 
polymerase. In the present invention, such excess is not needed. Preferably, more than 2.5, or even 10 fold 
excess of a dNTP is provided to the corresponding ddNTP. In a specific example, less than 10 uM ddNTP 
is used with a modified Taq of this invention. 

50 nK,J n 3 re ' ated 8SpeCt ' the invention ,eatures a *''< °f solution for DNA sequencing including a modified 
DNA polymerase as described above and a reagent necessary for the sequencing selected from the group 
cons.st.ng of dITP. deaza GTP. a chain terminating agent such as a ddNTP. and a manganese-containinq 
solution or powder. a 

In another aspect, the invention features a method for providing a modified DNA polymerase having an 
55 increased ability compared to the corresponding naturally-occurring DNA polymerase to incorporate a 
dideoxynucleotide by providing a nucleic acid sequence encoding the modified DNA polymerase express- 
ing the nucleic acid within a host cell, and purifying the DNA polymerase from the host cell 
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ae JLTSS 1, ? 8SpeCt ' *" invention features a m«hod for sequencing a strand of ONA ssentialiv 
as described above with one or more (preferably 2. 3 or 4) deoxynbonucleoside triphosphates TTotl 
polymerase as described above, and a first chain terminating agent. The DNA pofymerS Sus %T 
pnmer to be elongated to form a first series of first DNA products differing in the length of th etonoated 

i primer each first DNA product having a chain terminating agent at its elongated end anJ tie nuZr f 
molecules o each first DNA products being approximat ly th same for substantially all DNA products 
differing ,n length by no more than 20 bases. The method a. S0 features providing a secondlha* 
terminating agent in the hybridized mixture at a concentration different from the first chain terminal 
agent, wherem the DNA polymerase causes production of a second series of second DNA 3u<S 

> differing ,n the length of the elongated primer, with each second DNA product having the second chain 
terminating agent at its elongated end. The number of molecules of each second DNA product is 

ZZH^ySS" ^ f ?' * ut ? stantial, y 3,1 second DNA P^ucts differing in length from each other by 
from 1 to 20 bases, and is distinctly different from the number of molecules of all the first ONA oroducte 
having a length differing by no more than 20 bases from that of said second DNA products 

nr J?JZ en ^? ime x t !• ° f f0Ur SUCh Chain terminatin 9 a 9ents can be used to make different 
products « described ,n Tabor and Richardson, supra; and the sequencing reaction is provided w.Ta 
magnesium ,on. or even a manganese or iron ion (e.g., at a concentration between 0.05 and 100 mM 

W^tS of a a n ge, 1 ° ^ *" " * " ^ «"*» * * 

In another related aspect, the invention features a method for sequencing a nucleic acid by combinina 
an oligonucleotide pnmer. a nucleic acid to be sequenced, between one and four deoxynbonucleoside 
triphosphates, a polymerase as described above, and at least two chain terminating agents in different 
amounts, under conditions favoring extension of the oligonucleotide primer to form nucleic acid fraoments 
complementary to the nucleic acid to be sequenced. The method further includes separating the nucleic 
acid fragments by size and determining the nucleic acid sequence. The agents are differentiated from each 
other by intensity of a label in the primer extension products. 

While it is common to use gel electrophoresis to separate DNA products of a DNA sequencing reaction 
those in the art will recognize that other methods may also be used. Thus, it is possible to detect each of 
the different fragments using procedures such as time of flight mass spectrometry, electron microscopy 
and single molecule detection methods. 

The invention also features an automated DNA sequencing apparatus having a reactor includina 
reagents wh.ch provide at least two series of DNA products formed from a single primer and a DNA strand 
Each DNA product of a series differs in molecular weight and has a chain terminating agent at one end The 
reagents include a DNA polymerase as described above. The apparatus includes a separating means for 
separating the DNA product along one axis of the separator to form a series of bands. It also includes a 
band reading means for determining the position and intensity of each band after separation along the axis 
and a computing means that determines the DNA sequence of the DNA strand solely from the position and 
intensity of the bands along the axis and not from the wavelength of emission of light from any label that 
may be present in the separating means. 

In other aspects, the invention features: (a) a method fc in vitro mutagenesis of a cloned DNA 
fragment by providing the cloned fragment and a DNA polymerase described above, contacting the cloned 
fragment with the polymerase under conditions for synthesizing a DNA strand from the fragment The 
conditions cause formation of the DNA strand by incorporation of a plurality of individual contiguous bases 
able to base-pair with the fragment and incorporation of a nucleotide base unable to base pair with the 
fragment; (b) a method for in vitro mutagenesis of a template DNA fragment by providing a primer and 
template, the pnmer having contiguous bases able to base-pair with contiguous bases of the template 
except at least one base which is unable to base-pair with the template. The method involves extending the 
primer with a DNA polymerase as described above; (c) a method for producing blunt-ended double- 
stranded DNA from a linear DNA molecule having a 5* end having a single-stranded region. The 3' end of 
the molecule is double-stranded and has no 3* protruding termini. The method includes incubating the DNA 
molecule with a DNA polymerase as described above which acts on the single-stranded region to produce a 
blunt-ended double-stranded DNA molecule; (d) a method for labeling the 3' end of a DNA fragment by 
incubating the DNA fragment with a DNA polymerase as described above, and a labelled deoxynucleotide 
spec.es under conditions in which the 3' end of the DNA fragment is extended by the polymerase and 
thereby labelled by addition of the labelled deoxynucleotide to the DNA fragment- (e) a method of 
amplification of a DNA sequence by annealing a first and second primer to opposite strands of a double- 
stranded DNA sequence and incubating the annealed mixture with a DNA polymerase as described above 
The first and second primers anneal to opposite strands of the DNA sequence with their 3' ends directed 
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towards each other after annealing, and with the DNA sequ nc to be amplified located between the two 
annealed primers. 

In yet other aspects, the invention features specific DNA polymerases, such asThermus aquatlcus 
DNA polymerase having a tyrosine at residue 667, E. coli DNA polymerase I having a tyrosin at residue 
762. and any Pol I type DNA polymerases having a tyrosine residue at the analogous location to E. coll 
DNA polymeras residue 762, e.g.. at the N* position of the amino acid sequence K Ni isfe Na N* Ns Nc H? 
Y G, wherein each N is independently any amino acid. Furthermore, this invention features specific 
polymerases of the DNA polymerase alpha family having the sequence K Ni N 2 N 3 N* Ns Ng Y G/Q. 
wherein each N is independently any amino acid, and where one of the residues Ni to N7 has been 
mutated to produce a polymerase that has reduced discrimination against ddNTPs (preferably reduced by 
at least 20-fold compared to the non-mutated sequence). The invention also features nucleic acid encoding 
any of these DNA polymerases. 

In related aspects, the invention features DNA polymerases except reverse transcriptase which in the 
presence of magnesium as the only added divalent cation have an average processivity of less than 100 
and discriminate less than 100 times against incorporation of a ddNMP compared to a dNMP, or which in 
the presence of magnesium as the only added divalent cation have an average processivity of less than 50 
and discriminate less than 50 or 5 times against incorporation of a ddNMP compared to a dNMP. Those in 
the art will recognize that processivity can be measured by any standard procedure that will indicate that 
the average processivity of T7 DNA polymerase is at least 500, that of Klenow fragment is about 4 - 40, and 
for reverse transcriptase it is about 150 - 200. Such measurements can be performed as described by 
Tabor ef a/., J. Biol. Ohem. 262: 16212, 1987, hereby incorporated by reference herein. The average 
processivity of Taq DNA polymerase under these conditions is expected to be less than 100. 

In particularly preferred aspects, the invention features thermophilic DNA polymerases that discriminate, 
e.g., in the presence of magnesium, against a ddNMP compared to a dNMP by less than a factor of 100. 
and which preferably have an average processivity less than 100. and cycle from one primer-template to 
another more than once per one or even ten seconds. Such cycling can be measured by standard 
procedures. 

The invention also features a method for cycle sequencing using a DNA polymerase as described 
above, and also features cellular (as opposed to viral or mitochondrial) DNA polymerases having a tyrosine 
in place of the naturally occurring amino acid at a location which causes the polymerase not to discriminate 
against a ddNMP compared to a dNMP by more than 50 fold. 

In other aspects, substitution of the amino acid at the noted sites will result in alteration of other 
properties of the corresponding natural polymerase. In addition, polymerases of this invention may be 
combined with other polymerases in the methods described herein to take advantage of the superior 
properties of each polymerase in the mixture. 

Other features and advantages of the invention will be apparent from the following description of the 
preferred embodiments thereof, and from the claims. 

Description of the Preferred Embodiments 

The drawings will first briefly be described. 

Drawings 

FIG. 1 is a diagrammatic representation of the amino acid sequence of the DNA polymerase encoded by 
gene 5 of bacteriophage T7 indicating the palm and finger domains, and the location of various dideoxy 
resistant (DR) mutants, the location of regions labelled A-E, and the location of one site involved in 
ddNTP discrimination; 

FIG. 2 is a three dimensional representation of the structure of DNA polymerase I showing the locations 
of regions A-E; 

FIG. 3 is a diagrammatic representation of a riboselectivity region of pol l-type DNA polymerases with 
amino acids indicated by the universal single letter code. The initial amino acid number is indicated on 
the left of the figure and the amount of discrimination against dideoxynucleotides compared to 
deoxynucleotides is indicated on the right; 

FIGS. 4, 5, and 6 are diagrammatic representations showing modifications of the riboselectivity region of 
E. coli DNA polymerase I, T7 DNA polymerase, and Taq DNA polymerase, respectively. 
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Dideoxy Resistant Mutants 

The following is a brief discussion of publications of some relevahc . None are admitted to be pri r art 
to the pending claims but are provided to aid in understanding the present invention. 

5 Reha-Krantz ef a/. # Mutational Analysis of Bacteriophage T4 DNA Polymerase, from Abstracts for Post r 
Presentations, presented at a meeting entitled "The Fidelity of DNA Synthesis: Structural and Mechanistic 
Perspectives, Beaufort. North Carolina, September 24-29, 1989 describe C-terminal mutants with increased 
utilization of ddNTPs. However, Reha-Krantz ef ai., J. Virology 67. 60-66 (1993), indicate that while the Ki 
for ddGTP was 50 times lower with the mutant L412M compared to wild-type T4 DNA polymerase, no 

w difference in efficiency of incorporation of ddGTP was found between the mutant and wild-type T4 DNA 
polymerase. On page 63, it states that "Despite the sensitivity of the L412M DNA polymerase to ddGTP, 
there was no difference found in the incorporation of ddNTPs by wild-type and L412M DNA polymerases". 
It also states "it does not appear that any single region is the sole binding site for either PPi or 
nucleotides". In addition, Reha-Krantz and Nonay, J. Biol. Chem. 269, 5635-5643 (1994), provide a study of 

rs the mutant L412M and other mutant T4 DNA polymerases. 

Gibbs et ai, Proc. Natl. Acad. $cl. USA 85, 6672-6676 (1988) and Larder ef a/., the EMBO Journal 6, 
169-175 (1987) describe the spectrum of mutations obtained in Herpes DNA polymerase when selected for 
resistance to a number of nucleotide analogs: pyrophosphate, phosphonoacetic acid, and phosphonoformic 
acid, acyclovir, vidarabine, ganciclovir and bromovinyldeoxyuridine. It indicates that many of the mutants 
20 resistant to one drug are also resistant to other drugs, even when they are analogs to different regions of 
the substrate. 

Derse ef a/., J. Biol. Chem. 257. 10251-10260 (1982), describe five classes of mutants in Herpes 
Simplex DNA polymerase isolated by selection for growth in the presence of phosphonoformic acid, a 
pyrophosphate inhibitor. For mutants in each class, they compare resistance to ddGTP (page 10256, Table 

25 III). All increase the Ki for ddGTP by 20 to 100 fold. 

Prasad et a/., Proc. Natl. Acad. Scl. USA 88, 11363-11367 (1991) use a direct screening strategy and 
show that a single mutation in HIV reverse transcriptase (a change of Glu 89 to Glycine) renders the 
polymerase more resistant to ddGTP (requires about 10 times more ddGTP to obtain same extent of 
inhibition). This mutation confers a broad resistance to a number of analogs, including phosphonoformic 

30 acid, a pyrophosphate analog. While the mutant was equally resistant to ddTTP, ddCTP and ddGTP, it was 
much less resistant to ddATP. 

Song ef a/., J. Virol. 66, 7568-7571 (1992) mutate glu-89 of human immunodeficiency virus type 1 
reverse transcriptase to 9 different amino acid residues, and measure the resistance of each mutant 
enzyme to ddGTP and phosphonoformic acid, a pyrophosphate analog. The mutations fell into two classes; 

35 replacement of Glu-89 with alanine, glycine, valine or threonine resulted in enzymes highly resistant to both 
ddGTP and phosphonoformic acid compared to the wild-type enzyme, while mutation to serine, glutamine, 
asparagine, aspartic acid, and lysine resulted in enzymes with only moderate or no resistance to ddGTP. 
None of the mutants made the enzyme less resistant to ddGTP than the wild-type enzyme (Table I, page 
7569). The authors speculate that the 89th an* 90th residues of reverse transcriptase form a portion of the 

40 dNTP-binding pocket based on their results and the crystal structure of reverse transcriptase. 

Papers concerned with the properties of E. coli DNA polymerase I mutant proteins with mutations in the 
vicinity of the mutation that result in ddNTP selectivity including the following: 

Carroll ef ai. Biochemistry 30, 804-813 (1991) study two mutants: Tyr766Ser and Tyr766Phe for 
misincorporation of normal deo> , nucleotides. Polesky ef a/., J. Biol.Chem. 265, 14579-14591 (1990), 

45 characterizes mutations that have two different properties: (1) Tyrosine 766, arginine 841, and asparagine 
845; which the authors suggest that these residues contact the incoming dNTP. (2) Glutamine 849, arginine 
668, and aspartic acid 882, which the authors suggest are involved in catalysis. Polesky ef ai, J. Biol. 
Chem. 267:8417 (1992), further characterizes mutations in arginine 668, glutamine 849, and aspartic acid 
882, and also mutations at aspartic acid 705 and glutamic acid 710. In this study the authors look at the 

so incorporation of alpha-thio-substituted dNTPs, i.e. analogs in the phosphate moiety. Pandey ef ai, J. 
Biol.Chem. 269. 13259-13265 (1994) look at two mutants m E. coli DNA polymerase I that change lysine 
758 to alanine and arginine. The authors indicate that Basu et a/., Biochemistry 26, 1704-1709 (1987) 
implicate the same lysine 758 in dNTP binding. This was shown chemically; DNA polymerase I was 
covalently modified using pyridoxal S'-phosphate, a nucleotide analog, and lysine 758 was said to be the 

55 residue modified. 

Beese ef a/., Biochemistry 321: 14095-14101 (1993) describe the structure of a cocrystal of Klenow 
fragment of DNA polymerase I complexed with a dNTP or with pyrophosphate. The authors state that the 
dNTP binds adjacent to helix O. The authors make the following statements: (a) "In the Mg-dCTP complex. 
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sequ nee is made easier by having all four bands in a single lane. 

DuPont used to provide an instrum nt in which a different fluorescent marker was attached to each of 
four dideoxynucleoside triphosphates. Prober et a/., 238 Science 336, 1987. A single annealing step, a 
single polymerase reaction (containing each of the four labelled dideoxynucleoside triphosphates) and a 
s singl lane in the sequencing gel are required. The four different fluoresc nt markers in the DNA products 
ar detected separately as they are electrophoresed through the gel. 

Englert et a/., U.S. Pat. No. 4,707,235 (1987), describes a multichannel electrophoresis apparatus 
having a detection means, disposed substantially across the whole width of the gel. that can detect labelled 
DNA products as they migrate past the detector means in four separate lanes, and identifies the channel or 
io lane in which the sample is located. Preferably, radioisotopic labels are used. 

Inherent to procedures currently used for DNA sequence analysis is the necessity to separate either 
radioactively or fluorescently-labelled DNA products by a gel permeation procedure such as polyacrylamide 
gel electrophoresis, and then detect their locations relative to one another along the axis of movement 
through the gel. The accuracy of this procedure is determined in part by the uniformity of the signal in 
75 bands which have permeated approximately the same distance through the gel. Differences or variations in 
signal intensities between nearby bands create several problems. First, they decrease the sensitivity of the 
method, which is limited by the ability to detect the bands containing the weakest signals. Second, they 
create difficulties in determining whether a band with a weak signal is a true signal due to the incorporation 
of a chain terminating agent, or an artifact due to a pause site in the DNA where the polymerase has 
20 dissociated. Third, they decrease the accuracy in determining the DNA sequence between closely spaced 
bands since the strong signal of one band may mask the weak signal of its neighbor. See Tabor and 
Richardson, supra. 

Variation in band intensity can arise from an inherent property of most DNA polymerases. Most DNA 
polymerases discriminate against the chain terminating dideoxynucleotides used in DNA sequence analysis. 

25 T4 DNA polymerase discriminates against ddNTPs to such an extent that it cannot be used for DNA 
sequencing. E. coli DNA polymerase I, Taq, and Vent DNA polymerase also discriminate strongly against 
ddNTPs, each incorporating a ddNMP a thousand times slower than the corresponding dNTP. Tabor and 
Richardson supra, (both hereby incorporated by reference herein) have shown that T7 DNA polymerase lies 
at the other end of the spectrum, discriminating against ddNTPs only several fold . If a DNA polymerase 

30 discriminated against a ddNTP to the same extent at all sequences, this problem could be overcome by 
simply altering the ratio of ddNTPs to dNTPs. Such an approach has been used with £. coli DNA 
polymerase I and Taq DNA polymerase. However, the extent of discrimination varies with the adjacent DNA 
sequences, which leads to wide variation in the intensity of adjacent radioactive fragments. The intensity of 
specific fragments can vary by 50-fold for E. coli DNA polymerase 1 but only several fold for T7 DNA 

35 polymerase. Consequently, the intensity of bands on a DNA sequencing gel produced by T7 DNA 
polymerase are of similar intensity thus facilitating their detection and analysis by automated procedures. In 
addition, procedures that even further reduce the discrimination against dideoxynucleotides by T7 DNA 
polymerase are described such that it incorporates dideoxynucleotides equally as well as deoxynucleotides. 
These procedures and conditions also reduce but do not eliminate discrimination by other DNA poly- 

40 merases such as Klenow and Taq DNA polymerases. For example, the use of manganese in place of, or in 
addition to, magnesium in the reaction mixture may reduce or eliminate discrimination against dideox- 
ynucleotides. Under such conditions, T7 DNA polymerase does not differentiate between the two molecules 
whereas other DNA polymerases such as Klenow fragment, Taq and Vent still discriminate to some degree. 
For example, Klenow still discriminates against ddNTPs by as much as four-fold in the presence of 

45 manganese. More important, even though the overall degree of discrimination by such enzymes as Klenow 
and Taq DNA polymerases is reduced, the intensity of specific fragments can vary by much more than four 
fold due to high discrimination at certain sequences in DNA. These polymerases and procedures are now 
almost universally used in manual DNA sequencing (/.e., without aid of sequencing machines such as 
described above) and are extensively used in automated methods. The use of manganese and the lack of 

so discrimination against ddNTPs at all sites results in bands of uniform intensities, thus facilitating the reading 
of sequencing gels, either by manual or automated procedures. Moreover, the lack of discrimination enables 
the use of novel procedures for sequence analysis (T abor and Richardson, supra). A method based on this 
finding is provided to determine a DNA sequence in a single reaction that contains all four ddNTPs at 
different ratios, by measuring the relative intensity of each peak after gel electrophoresis. The authors 

55 indicate: 

The DNA polymerases of this invention do not discriminate significantly between dideoxy-nucleotide 
analogs and normal nucleotides. That is, the chance of incorporation of an analog is approximately the 
same as that of a normal nucleotide or at least incorporates the analog with at least 1/10 the efficiency that 
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cytos.n ,nt racts with His 881. whil th sugar interacts with Phe 764 [sic.. 762] (Figures 3 and 51 - rw 
However we conclude that the position of at least th dNMP moi ty of dNTP in the bfna^ctTptex s S 
l.ke y to be the same as in its catalytically relevant complex with primer-temp.ate DNA? STSEeJ Z 
enure binding s,te for the base of dNTP is formed by its Watson-Crick hydrogen bonding t I teTptoe 

f ■? ? StaCki " 9 °" *• 3 ' b3Se ° f «" primer *»* " is not "n'*e<y that th b^ing site for 
base m the binary complex is complet ly adventitious, consistent with our observation that it can bind in 
several locations dependent on conditions.- (d) "The binding site for dNTP observed in cryStfft! 
binary complex is he same as observed in solution studies. However, extrapolating from this binary 
complex to a model for the complex with dNTP in the presence of primer arTtemplate DNA requires 

DNA to bind in correct conformation." 

Joyce and Steitz. 63 Ann. Rev. Bloc. 777. 1994 (not admitted to be prior art to the present invention) 

?a S m er S thl e TT'^ ^ ° NA " d RNA P 0 *™ 38 " 5 - " indiC3teS *"» iTJw 
frather than the finger ) subdomain of DNA polymerase I - namely, the catalytic center, the binding site for 

. the 3 terminus of the pnmer. and the dNTP binding site. In HIV-1 reverse transcriptase it indicates that 
Z J°?,!L tha, J nfluence bindin 9 of DNA Polymerase inhibitors are around residues 67 - 70. It also states 
S LI S'Vl H? USefU ' conc,usions be drawn from the positions of the nucleotide base of sugar it 
L^l TL ^' in6 bina,y ^P' 6 " may 156 in,0 ™ a « v « * identifying contacts between Wenow 
fragment and the dNTP phosphate groups." In the preceding paragraph, it states "although a polymerase- 

* T P 6X Ca " * '° rmed ' sucn a complex is not catalytically competent." It further indicates 
that data would place the deoxyribose close to Phe 762" and that "Mutation of Tyr 766 [in Wenow 

2271 ! L ° h iS r6cat9<1 in tne fin9ers domain in *» vicinit y of 106 model-built template strand 
affects the discrimination between deoxy and dideoxy nucleotide substrates . . ." However, it also states "In 

9m J nt " mutations tnat have been found to affect the binding of dNTP in the ternary complex (as 
renewed in are located on one side of the polymerase cleft within or close to the finaers 

subdomain. Posit,ons identified thus far encompass the N terminus of helix Q (Arg 841 and Asn 845) the 
exposed face of helix 0 (Tyr 766. Phe 762. and Arg 754). and neighboring residues closer to the catalytic 
center (Asp 705 and Glu 710) ... An advantage of the kinetic approach is that it probes the ternary 
complex; however, as discussed above, it is impossible, in the absence of other structural evidence to 
distinguish direct effects from those mediated via template interactions. Moreover, the side chains listed 
above encompass an area much larger than the dNTP molecule and therefore cannot all be in direct 
contact with .t. Since the region of Klenow fragment implicated by these studies is thought to make 
extensive contact with the template strand, a reasonable interpretation is that a subset of the residues 
mentioned above are in direct contact with the dNTP. while the remainder bind the template DNA " 

Pellet.er ef al., 264 Science 1891. and Sawaya ef a/.. 264 Science 1930. 1994 (not admitted to be 
pnor art to the pending claims) in contrast, indicate that residues 271 - 274 in helices M-N of PoliJ (which 
are analogous to helices J-K of Klenow) "perform a common function, nucleotide discrimination " 

Sousa et al. 364 Nature 593. 1993 (not admitted to be prior art to the pending claims) describe the 
three dimenaonai structure of T7 RNA polymerase and its homology to E. coli DNA polymerase I They 
state that their observations suggest the "C-terminal elements of KF [Klenow fragment] (b-strand 14 
[residues 916 to 928] and the C terminal) contact the deoxyribose moiety of the dNTP during polymeriza- 
tion to discriminate between rNTP and dNTP substrates." 

Dideoxynucleosides. such as dideoxythymidine, are potent inhi*^r S of T7 phage growth. Experiments 
indicate that the inhibition of DNA synthesis is a consequence of the incorporation of the dideoxynucleotide 
into i T7 DNA Dideoxynucleosides are not inhibitory to uninfected r. coli. We do not know the explanation 
for the lack of inhibition of £. coli DNA synthesis but it could be explained by cellular uptake, a high level of 
discrimination against their incorporation by £ coli DNA polymerase III. inefficient phosphorylation to a 
triphosphate, or efficient removal. In any case we find that T7 mutant phage arise that can yield normal 
plaques on agar plates containing dideoxynucleosides with a frequency of approximately 10"3 The location 
of many of these mutations are shown in Fig. i. They reside within the gene 5 protein. The mutant gene 5 
proteins are more discriminatory (by a few fold) against ddNTPs than the native gene 5 protein Some 
members of this class of mutants may delineate the region of the polymerase that is important in 
recognition of the ribose moiety of the dNTP. 

It is important to note that the mutations obtained by this selection using dideoxynucleosides are based 
on an alteration of the region of the gene 5 protein that recognizes the ribose moiety. However it is 
possible that such mutations will also have dramatic effects on other nucleotide analogs. In addition it is 
possible to use the same procedure to select for other T7 mutants that discriminate strongly against other 
nucleotide analogs on the basis of the growth of phage in the presence of the other analogs 
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Ref rring to Table I, various DR mutants ar indicated with the amino acid substitution noted in th 
table. The amino acid substitution is further characterized on the right hand side of the table. The location of 
thes mutants is shown in FIG. 1 throughout the palm and finger regions of T7 DNA polymerase. Th 
thumb region is a flexibl region that interacts with minor groove of product duplex DNA; the palm region is 
th catalytic center, the binding site for the 3' nd of the primer and contributes to dNTP binding; th finger 
regions interact with ss template close to the site of synthesis and contribute to dNTP binding. These 
mutants are highly scattered throughout the polymerase and all have a relatively minor effect on 
incorporation of dideoxynucleotides, decreasing the ability to incorporate a dideoxynucleotide by only 5-20 
fold. In addition, some of these mutants are located in regions of comparative non-homology to DNA 
polymerase I. Thus, they do not provide an indication of the location of sites in other Pol I enzymes 
involved in ddNTP discrimination. 



Table 1 



Summary of T7 DNA Polymerase Dideoxy-Resistant Mutants 


Mutant 


No. 


Modification 






isolates 






DR1 


1 


Ala 425-* Thr 


Hydrophobic-* Polar 


DR2 


1 


Phe 434— Ser 


Hydrophobic— Polar 






Gly 442- Ser 


Hydrophobic— Polar 


DR3 


1 


Vai 443- lie 


Hydrophobic— Hydrophobic 


DR4 


2 


Arg 444— His 


Strong basic— Weak basic 


DR5 


1 


Arg 444— Cys 


Strong basic— Neutral, polar 


DR6 


8 


Ser 477- Phe- 


Polar— Hydrophobic 


DR7 


4 


Asp 504— Asn 


Basic— Neutral 


DR8 


2 


Ala 513- Thr 


Hydrophobic— Polar 


DR9 


2 


Thr 51 7- lie 


Polar— Hydrophobic 


DR10 


1 


Ala 532- Ser 


Hydrophobic— Polar 


DR11 


1 


Arg 566— Cys 


Strong basic— Neutral, polar 


DR12 


1 


Ala 61 9- Thr 


Hydrophobic— Polar 


DR13 


3 


Ala 700- Thr 


Hydrophobic— Polar 



Summary 

7 Hydrophobic— Polar 
3 Strong basic— Neutral, polar or weak basic 
2 Polar— Hydrophobic 
1 Hydrophobic— Hydrophobic 
In vitro Mutagenesis 

in vitro mutagenesis of the cloned gene 5 of T7 was used to construct gene 5 proteins in which 
different regions of E. coli DNA polymerase I were substituted for the analogous or homologous regions in 
T7 gene 5 protein. As discussed, we were particularly interested in determining the ability of these enzymes 
to incorporate nucleotide analogs and the extent to which they discriminate against these analogs. Referring 
to FIG. 2. the regions within which hybrids between T7 DNA polymerase and E. coli DNA polymerase I 
were made are shown as marked as regions A-E. 

Referring to FIG. 3. Applicant has determined that region C provides a riboselectivity region having a 
significantly greater effect than other regions in the polymerase. Some of these other regions are 
specifically indicated in FIG. 3. 

Referring to FIGS. 4-6 (and Table 2) , it was determined that substitution of amino acids in this region 
allowed conversion of the riboselectivity of a polymerase from a E. coli DNA pol l-type to a T7 DNA 
polymerase type and vice versa. Thus, by targeted mutagenesis of this region of poll-type polymerases 
the riboselectivity of a polymerase can be significantly altered. The level of effect is at least 50-100 fold, 
and generally more than 500-fold. 
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PNA Polymerases 

DNA polymerases useful in this invention include those belonging to the class of homologous 
polymerases t rmed "Pol l-type DNA polymerases" including T7-type DNA polym ras s. the larg 
fragment of £ coil DNA polymerase l f and Taq polymerase. 

Dr A polym rase useful in this invention include those belonging to a class of homologous polymerases 
including T7 type DNA polymerases (such as T7, T3, 01, 011, H W31, gh-1, Y, A1122. or SP6). By 
homologous polymerases is meant an enzyme such as those described by Delarue et a/.. Protein 
Engineering 3. 461-467 (1990), that presents an alignment of the Pol I family of DNA polymerases. It also 
presents the alignment of 
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Table 2. 



10 



15 



a t of ? omam ^changes between E. coli DNA polymerase I, T7 DNA polymerase 

and Taq DNA polymerase within helix 0 on discrimination against ddNTPs. The sequence of 
the three polymerases are shown at the top, with the number of the first residue indicated 
Be ow the consensus sequence for these three polymerases the mutants characterized in T7 
DNA polymerase (T7), E. coli DNA polymerase I (Pol) and Taq DNA polymerase (Tag) are 
shown, w,th the mutated residues underlined/Each mutant was tested for its relative rate of 
incorporation of ddNMP to dNMP by SDS activity gel analysis as described in Examole 2 
and mdicated on the right. Mutants T7 C-T8, Pol I C-K6, and Taq C-Q5 were purified aW 
with the wild-type proteins for further analysis. * 



Enzyme 



Sequence 



ddNTP Discrimination 



20 



25 



30 



35 



40 



45 



Pol I 754 
Taq 658 
T7 517 
Consensus 

T7 WT 
T7 C-T2 
T7 C-T3 
T7 C-T4 
T7 OT5 
T7 C-T6 
T7 C-T7 
T7 C-T8 

Pol I WT 

Pol I OKI 

Pol I C-K2 

Pol I C-K3 

Pol I C-K4 

Pol I C-K5 

Pol I C-K6 

Taq WT 
Taq C-Ql 
Taq C-Q2 
Taq C-Q3 
Taq C-Q4 
Tqa C-Q5 



R R S A K A I 

R R A A K T I 

R D N A K T F 

R A K 



N F G L I Y G 

N F G V L Y G 

I Y G F L Y G 

G Y G 



RDNAKTFI Y- G F L Y G 



R E_§ A K A I N F G 
R fi__S A K T F I Y G 
R D N A K A I N F G 
R D N A K A_I I Y G 
R D N A K T F N_F G 
RDNAKTFHYG 



Il-L V G 

F L Y G 

F L Y G 

F L Y G 

F L Y G 

F L Y G 



RDNAKTFIFGFLYG 



R 
R 
R 
R 
R 
R 
R 



R S 

J2_H 
R S 
R S 
R S 
R S 
R S 



R R A A 

RO A 

R R A A 

R R A A 

R R A A 

R R A A 



INF 
F I Y 



T F I Y 

T F N F 

A I I Y 

A I I F 

A I N X 



L I 
L I 



I N F G 
I N F G 
F I Y G 
I I_Y G 
I I F G 
I N Y G 



G 
G 
G 
G 
G 
G 
G 

G 
G 
G 
G 
G 
G 



High 
High 
Low 



Low 

High 

Low 

High 

Low 

High 

Low 

High 

High 

Low 

Low 

High 

Low 

High 

Low 

High 

High 

Low 

Low 

High 

Low 



Specificity residue 



50 



55 



conserved sequence motifs from six families of polymerases: DNA polymerases from Pol I, Pol alpha and 
Pol beta families. DNA-dependent RNA polymerases, reverse transcriptases, and RNA-dependent RNA 
polymerases; their results suggest a few residues are conserved between all polymerases According to 
their alignment (Figure 3, page 463), the selectivity residue identified herein (phenylalanine 762 in E coli 
DNA polymerase I) is in "Motif ET. In addition to the pol I family of DNA polymerases. Motif B is found in 
the pol alpha family of DNA polymerases and the T7 family of DNA-dependent RNA polymerases- thus this 
alignment strongly suggests the residues one should mutate in these other families of polymerases that 
include T4 DNA polymerase. Herpes DNA polymerase, 029 DNA polymerase. Vent DNA polymerase and 
Pfu DNA polymerase. 
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In addition, Joyce, Current Opinion in Structural Biology 1. 123-129 (1991) compar s th DNA 
sequences from many polym ras s, and suggests that a small numb r of important active-sit r sidu s ar 
conserved. In particular, there is a discussion of th similarities between polymerases of the pol I family 
(T7, pol I, Taq) and of the pol alpha family (T4, 029. Herpes). In Figure 1 (page 124) Joyce indicates the 
positi ns of 5 invariant residues found in these two families; th y include lysine 758, tyrosine 766. and 
glycine 767; these are all very close to the selectivity residue identified herein, phenylalanine 762. 

These polymerases are used in a DNA sequencing reaction under conditions in which they preferably 
produce nearby bands of approximately uniform intensity (with about a 1.5- to 2.0-fold variation in intensity) 
when the DNA products of the sequencing reaction are run in a gel. By nearby is meant to include bands 
representing DNA products of molecular weight differing by as much as 6000, /.e., 20 bases. The actual 
value of this intensity will decrease along the length of the gel, as described in Tahor and Richardson, 
supra. Band intensity reflects the number of DNA products within a certain band. Labels such as 
fluorophores or radioisotopes, are used to produce a readily detectable band of intensity reflective of the 
number of such DNA products. Thus, in this invention, nearby bands derived from one sequencing reaction 
with one chain terminating agent have approximately the same number of DNA products and thus a uniform 
band intensity. The sequencing conditions include incubation of the polymerase in the presence of specific 
divalent or trivalent cations such as manganese (II and III), ferrous and ferric ions; monovalent and divalent 
cations which have no detectable effect, or are detrimental to DNA synthesis, include: K, Na, Ba, Be, Ca, 
Cc, Cr, Co, Cu, Ni, Si and Zn. The anion is unimportant, for example, chloride, acetate, and sulfate are 
suitable. Under these conditions the requirement for chain terminating agents, such as dideoxynucleosides. 
may be lessened by several-fold for enzymes for this invention. A chelator may also be provided in this 
solution in order to help regulate the concentration of available divalent metal ions. For example, citrate or 
isocitrate may be provided. These chelates are thought to maintain the level of, for example, free 
manganese ions at a concentration of between 10 and 100 uM over a wide range of manganese 
concentrations. That is, the chelator acts as a buffer. 

The DNA polymerases of this invention do not discriminate significantly between dideoxynucleotide 
analogs and deoxynucleotides along the length of the DNA template. That is. these polymerases are unable 
to discriminate significantly between a nucleotide that has a 3' hydroxyl group versus one that does not 
(i.e., has two hydrogens at the 3' position of the ribose). However, these polymerases may discriminate 
against modifications at other positions on the nucleosides, even in the presence of manganese or iron. For 
example, the polymerases may discriminate against some dideoxynucleotide analogs which have fluores- 
cent groups attached compared to deoxynucleotides. However, the polymerases do not discriminate to a 
different extent at neighboring, or nearby nucleotides, on the basis of the presence or absence of the 
modification to the dideoxynucleotide. Thus, while they discriminate strongly against these analogs, 
requiring higher concentrations for a DNA sequencing reaction compared to unmodified dideox- 
ynucleosides. the intensity of nearby bands will still be uniform. 

Thus, the polymerases of this invention provide a uniform efficiency of incorporation of chain terminat- 
ing agents, even if they discriminate against overall incorporation. In addition, other polymerases of this 
invention will give more uniform bands with fluorescent ddNTPs than the corresponding naturally occuring 
enzyme, although not as uniform as with unlabels or radioactively labelled ddNTPs. 

Chain terminating agents useful in this invention include dideoxynucleosides having a 2', 3'dideoxy 
structure. Other agents useful in the invention are those able to specifically terminate a DNA sequencing 
reaction at a specific base, and are not discriminated against by the polymerase under the above 
conditions. 

In order to determine whether any particular DNA polymerase, in combination with any particular chain 
terminating agent, or other component of a sequencing reaction mixture, is useful in this invention, a 
standard sequencing reaction is performed, as described in Tabor and Richardson, supra, and the extent of 
band formation, and the uniformity of nearby bands in a sequencing gel, reviewed. If the polymerase 
reaction does not extend the primer by at least 20 bases, it is not suitable under the conditions used. 
Adjacent band uniformity within a two-fold or less range is useful in this invention, preferably the uniformity 
is within a 1.0-1.5 fold range. Similarly, determination of optimum cation concentration, or of other potential 
cations useful in the invention, is determined by use of this sequencing reaction under various conditions. 
For example, cations are tested in ranges from 0.005-100 mM. An example of such an experiment follows: 

The ability to incorporate a given ddNMP compared to the corresponding dNMP for any one enzyme is 
measured as the ratio of ddNTP to dNTP necessary to allow DNA synthesis that terminates in a fixed 
range, detected as producing bands of no greater than a fixed molecular weight. That is. the bands 
produced in the reaction end within a specified range in the sequencing gel. Thus, if one enzyme 
discriminates 1000-fold greater against a given ddNTP compared to another enzyme, a 1000-fold higher 
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ratio of ddNTP to dNTP will b necessary to obtain bands terminating at the corr spending sites in th 
same rang of th gel. 

Exonuclease Activity 

The DNA polymerases of the invention preferably have less than 50%, preferably less than 1% and 
most preferably less than 0.1%. of the normal or naturally associated level of exonuclease activity (amount 
of activity per polymerase molecule). By normal or naturally associated level is meant the exonuclease 
activity of e.g.. an unmodified T7-type polymerase. Normally the associated activity is about 5.000 units of 
exonuclease activity per mg of polymerase, measured as described below by a modification of the 
procedure of Chase et al. (249 J. Biol. Cne/n. 4545, 1974). Exonucleases increase the fidelity of DNA 
synthesis by excising any newly synthesized bases which are incorrectly basepaired to the template. Such 
associated exonuclease activities can be detrimental to the quality of DNA sequencing reactions. They raise 
the minimal required concentration of nucleotide precursors which must be added to the reaction since 
when the nucleotide concentration falls, the polymerase activity slows to a rate comparable with the 
exonuclease activity, resulting in no net DNA synthesis, or even degradation of the synthesized DNA 

More importantly, associated exonuclease activity may cause a DNA polymerase to idle at regions in 
the template with secondary structure impediments. When a polymerase approaches such a structure its 
rate of synthesis decreases as it attempts to pass. An associated exonuclease will excise the newly 
synthesized DNA when the polymerase stalls. As a consequence numerous cycles of synthesis and 
excision will occur. This may result in the polymerase eventually synthesizing past the hairpin (with no 
detriment to the quality of the sequencing reaction); or the polymerase may dissociate from the synthesized 
strand (resulting in an artifactual band at the same position in all four sequencing reactions); or, a chain 
terminating agent may be incorporated at a high frequency and produce a wide variability in the intensity of 
different fragments in a sequencing gel. This happens because the frequency of incorporation of a chain 
terminating agent at any given site increases with the number of opportunities the polymerase has to 
incorporate the chain terminating nucleotide. 

An ideal sequencing reaction will produce fragments that give bands of uniform intensity throughout the 
gel. This is essential for obtaining the optimal exposure of the X-ray film for every radioactive fragment If 
there is variable intensity of radioactive bands, then fainter bands may go undetected. To obtain uniform 
radioactive intensity of all fragments, the DNA polymerase should spend the same interval of time at each 
position on the DNA, showing no preference for either the addition or removal of nucleotides at any given 
site. This occurs if the DNA polymerase lacks any associated exonuclease, so that it will have only one 
opportunity to incorporate a chain terminating nucleotide at each position along the template. 

Short Primers 

The DNA polymerase of the invention is preferably able to utilize primers of 10 bases or less, as well as 
longer ones, most preferably of 4-20 bases e.g.. 6 bases (which can ^ used in groups of three to form an 
equivalent of an 18-mer). The ability to utilize short primers offers a number of important advantages to 
DNA sequencing. The shorter primers are less expensive and easier to synthesize than the usual 17-mer 
primers. They also anneal faster to complementary sites on a DNA template, thus making the sequencing 
reaction faster. Further, the ability to utilize small (e.g., six or seven base) oligonucleotide primers for DNA 
sequencing permits strategies not otherwise possible for sequencing ,oiig DNA fragments. For example, a 
kit containing 80-4000 random hexamers could be generated, none of which are complementary to any 
sites in the cloning vector. Statistically, one of the 80 hexamer sequences will occur an average of every 50 
bases along the DNA fragment to be sequenced. The determination of a sequence of 3000 bases would 
require only five sequencing cycles. First, a "universal" primer (e.g.. New England Biolabs #1211, sequence 
5' GTAAAACGAACGGCCAGT 3') would be used to sequence about 600 bases at one end of the insert. 
Using the results from this sequencing reaction, a new primer would be picked from the kit homologous to a 
region near the end of the determined sequence. In the second cycle, the sequence of the next 600 bases 
would be determined using this primer. Repetition of this process five times would determine the complete 
sequence of the 3000 bases, without necessitating any subcloning, and without the chemical synthesis of 
any new oligonucleotide primers. The use of such short primers may be enhanced by including gene 2.5 
and gene 4 protein of T7 in the sequencing reaction. 



16 



EP 0 655 506 A1 



20 



25 



30 



35 



40 



45 



50 



55 



In vitro Mutagenesis 

Mutagen sis of the polymerase genes was carried out using standard PCR techniqu s (see below). 
5 Discrimination against ddNTPs 

4 fold ^ZTtmrtT^t™ 98 th l° nly diVa,6nt Cation ' 77 DNA Phrase discriminates about 3 - 

In the presence of manganese discrimination by T7 DNA polymerase and E. coll DNA polymerase I am 
reduced; with T7 DNA polymerase, it is reduced from 3.7 to 1. and with £ coll DNA pdSSZTl ifS 
reduced from 550 to 3.9 (for ddATP). Applicant is the first to provide a DNA iXSTSTll thl 

aTeSr,eirr:'r ions r * e on,y c,iva,9n, cauons has a pr< * ess,vi,y * - *2^?£S l z 

V^SZ^L ° m 3 9iVe " Primer Mor * dissocia «"9 from the primer-template; Reverse 

transcr,ptas u has a processivity by this definition of about 150 - 200. and T7 DNA polymerase haT! 

co^TZS^T^ m "is* d , iscrimina,es ,ess 100 f0,d a9ainst ln a 

contras . most of the known DNA polymerases, such a Taq that have a processivity less than 100 
d.scnm.nate more than 100 fold gainst incorporation of a ddNMP. ^ssiviry less tnan 100 

Previously it was believed that whereas polymerases with high processivity such as T7 DNA poly- 
merase rema.n bound to a primer-template for up to several minutes, polymerases with low pmcUSk 

ona hLlT NA ""T" ' CyC ' e fr0m ° ne Primer temp ' a,e 10 anoth9r ™* te » sLn^Xer 
one hundred tames more frequently). See for example Tabor.ef al. J. Biol. Chem. 262 16212-16223 EiS? 

2 LT^T" - S advanta9eous for DNA fencing, as in reducing the background due to terminations 
not from d.deoxy mcorporation. the slow cycling time is a disadvantage. For example, if the polymerase 

there .s a large excess of polymerase present. On the other hand, with a polymerase that cycles raoidlvT 

Z^LZlT T P °' ymeraSe SlnCe 3 Sin9 ' e 6n2yme m0,eCu,e "»^* "W^SS Zer y s 
«Zh^ h i« Sequencin 9 reaction - «* 9iven primer end will have the opportunity to be 

many erent poymerase molecu,es • decreasing the chance of stron9 SP ^ C S, °P S «™ 

,o nHoT!^ " '! th3t 3 po,ymerase c V cles rapidly also incorporates ddNMPs efficiently, in order 
to give bands of uniform intensity and allow one to use less ddNTPs. It is also preferred that such a 

!f hZcT T °l n ° exonuclease activit V- that one add pyrophosphatase to prevent degradation 
of bands by pyrophosphorolysis. ^ 

It is also preferred that one be able to carry out DNA sequencing reactions with magnesium as the only 
™ J! Z l - the abSen ° e °' man 9 anese )- Rrs t. Polymerases tend to be less active with manganese 
m^qTv o ma9 ? S ' Um <Se " ,0f 8Xamp,e Tab0r 30(1 Rich ^dson. Proc. Natl. Acad. Sci. USA 86. 4076- 
Ze !!L r m T Polymerases tend to be active over wide ranges of magnesium concentrations. 
m a h I P " ° ptimum f" 3 ^ 30689 concentration required in most cases for optimum activity 
ioicT hhLtp«° P 7 ma " 9ane u se concentrations there is much less effect on reduction of discrimination 
against ddNTPs as at much h.gher concentrations, where the polymerase is much less active Third 

w a h^ e Ht'V 0t J S t C ° nVenient 3 m9ta ' i0 " t0 have in a kit: il readi| y ,orms Palpitates, particularly at 
higher pH s. Fourth, .t is not clear whether manganese will be effective as a metal ion for reducing 
discrimination against ddNTPs with any thermophilic polymerase (i.e.. at higher temperatures) 

am* "° r t0 the PfeSent invention - we stated (Tabor and Richardson Proc. Natl. Acad. Sci USA 86 4076- 
4080 (1 989)) that there was a correlation between discrimination and processivity: 

"DNA polymerase I has a low processivity. dissociating after the incorporation of less than 10 
nucleotides. There is a strong correlation between the frequency at which the enzyme dissociates from a 
site during DNA synthesis in the absence of ddNTPs, and the extent of discrimination against the 
incorporate of a ddNMP at that site (unpublished results). This suggests that DNA polymerase I 
mcorporates dNMPs ; and ddNMPs at similar rates during processive synthesis; however, when synthesis is 
nonprocessive. dNMPs are incorporated preferentially over ddNMPs. This model could account for the 
greater variabiliy in ddNMP incoporation byDNA polymerase I complared to T7 DNA polymerase, since the 
latter has a processivity two orders of magnitude greater than the former." [Citations omitted ] 

Thus the results of this invention with E. coli DNA polymerase I and Taq DNA polymerase are 
surprising, since we find no evidence that the mutants described herein do increase the processivity of the 
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mutant enzymes. 
Thermophilic polymerases 

Thermophilic polymerases that discriminate against a ddNTP by less than a factor of 100 are 
particularly us ful in this invention. In, addition, those that discriminate against a ddNTP by less than a 
factor of 100 in the presence of magnesium as the only divalent cation and preferably cycle from one 
primer-template to another more than once per second are useful. Thermophilic polymerases are defined as 
polymerases that have optimum DNA polymerase activity in a 15 min. reaction at a temperature above 
60 *C. 

Uniform band intensities 

Even though manganese reduces the discrimination of Klenow fragment against ddATP from 550 to 3.9 
fold Tabor and Richardson (Proc. Natl. Acad. ScL USA 86, 4076-4080 (1989)) show that there is still wide 
variability in the intensity of individual bands (see Figure 2 /d). Thus, apart from T7 DNA polymerase, this 
invention is the first to provide polymerases that cycle rapidly such as Klenow fragment and those that are 
derived from thermophilic organisms to produce bands that have uniform intensifies, even in the presence 
of magnesium as the only divalent cation, conditions that are preferable for the activity of most polymerases 
(see above). Enzymes which cycle rapidly can be determined by methods known in the art as described 
below. 

Specific Polymerases 

From the above information it is possible to readily, make the following polymerases which will have the 
desired properties discussed above. Each of these polymerases can be used for sequencing procedures if 
the level of exonuclease is low enough and the activity of the polymerase is sufficient (both of which are 
well known in the art). See Braithwaite and Ito, supra for reference to each amino acid site. 

1. Pol I family 

E. coli DNA polymerase I with altered Phe762 (altered means replaced with e.g.. Tyr, or an equivalent 
amino acid to give the non-discriminatory property). 

Streptococcus pneumoniae DNA polymerase I with altered Phe7l i. 
Therm us aquaticus DNA polymerase I with altered Phe667. 
Thermus ftavus DNA polymerase I with altered Phe666. 
Bacteriophage T5 DNA polymerase with altered Phe570. 
Bacteriophage Spo 1 DNA polymerase with altered Leu526. 
Bacteriophage Spo 2 DNA polymerase with altered Phe690. 

Mitochondrial DNA polymerase with natural Tyr753 or altered at this site without reducing the non- 
discriminatory activity. Such a polymerase has not previously been used for DNA sequencing. Applicant 
believes that it will be useful in such a procedure because of its expected low level of ddNTP discrimina- 
tion. If needed, it can be modified to reduce exonuclease activity associated with the polymerase activity. 

2. Polymerase alpha family (also called polymerase II family) 

Delarue et al, Protein Engineering 3. 461-467 (1990) show that the two families of polymerases 
(polymerase I family and polymerase alpha family) share three common motifs. The region they call "Motif 
B" contains the residue we have identified as responsible for specificity for the dideoxyribose moiety. This 
region is characterized by the sequence K Ni N 2 N 3 N* N 5 N 6 N 7 Y G in the polymerase I family, where 
N* is the specificity residue: if N* is a phenylalanine there is high discrimination, if N 4 is tyrosine there is 
low discrimination. In the polymerase alpha family, the sequence is K N, N 2 N 3 N* N 5 N $ Y G (there is one 
less base between the conserved residues). We predict therefore that just as with polymerase I type 
enzymes, changes to the residue(s) in this motif (betwen the lysine (K) and the tyrosine (Y)) will reduce the 
discrimination of these polymerases to ddNTPs. These residues are as follows: 
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Escherichia coli DNA polymerase II 



PRDl DNA polymerase 



0 29 DNA polymerase 



M2 DNA polymerase 



T4 DNA polymerase 



Pyrococcus furiosus DNA polymerase 



Sulfolobus solfataricus DNA polymerase 



Human DNA polymerase alpha 



S. cerevisiae DNA polymerase I (alpha) 



5. pombe DNA polymerase I (alpha) 



Ue494-Phe499 

Leu341-Ser346 

Leu384-Leu389 

Leu381-Leu386 

De558-Leu563 



Thermuococcus litoralis DNA polymerase (Vent) Leu492-Tyr497 



Leu489-Phe494 
Val604-Thr609 
Leu951-His956 
Leu945-His950 
Leu931-His936 



Drosophila melanogaster DNA polymerase alpha Leu960-His965 
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Trypanosoma brucei DNA polymerase aipha 
Human DNA polymerase delta 
Bovine DNA polymerase delta 
S. cerevisiae DNA polymerase HI (delta) 
S. pombe DNA polymerase IH (delta) 
Plasmodium falciparum DNA polymerase delta 
5. cerevisiae DNA polymerase II (epsilon) 
£ cerevisiae DNA polymerase Rev3 
Herpes Simplex virus type 1 DNA polymerase 
Equine herpes virus type 1 DNA polymerase 
Varicella-Zoster virus DNA polymerase 
Epstein-Barr vims DNa polymerase 
Herpesvirus saimiri DNA polymerase 
Human cytomegalovirus DNA polymerase 
Murine cytomegalovirus DNa polymerase 
Human herpes vinii type 6 DNA polymerase 
Channel Catfish virus DNA polymerase 
Chlorella virus DNA polymerase 
Fowlpox virus DNA polymerase 
Vaccinia virus DNA polymerase 



Leu845-His850 

Val695-Val700 

Val694-Val699 

Ile702-Val707 

VaI681-Val686 

De692-Val697 

Val82-Phe830 

LeuI087-Thrl092 

Val812-Val817 

Val813-Val818 

VaI776-Val78I 

Cys682-VaI687 

Val671-Val676 

Val811-Phe816 

Val717-Phe722 

fle667-Val672 

De750-His755 

Qe586-Val59l 

ne648-Val653 

De637-VaI642 
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Choristoneura biennis DNA polymerase 



De669-Leu674 



Autographa californica nuclear polyhedrosis 



virus (AcMNPV) DNA polymerase 



Arg606-De611 



Lymantria dispar nuclear polyhedrosis virus 



DNA polymerase 



Adenovirus-2 DNA polymerase 



Adenovims-7 DNA polymerase 



Adenovirus- 12 DNA polymerase 



S-l maize DNA polymerase 



kalilo neurospora intermedia DNA polymerase 



pAI2 Ascobolus immersus DNA polymerase 
pCLKl Claviceps purpurea DNA polymerase 



Maranhar neurospora crassa DNA polymerase 



pEM Agaricus bitorquis DNA polymerase 



pGKLl Kluyveromyces lactis DNA polymerase 
pGKL2 Kluyveromyces lactis DNA polymerase 
pSKL Saccaromyces kluyveri DNA polymerase 



Arg624-Ile629 

Leu696-Leu701 

Leu762-Leu767 

Leu694-Leu699 

Leu618-Leu623 

Leu776-Leu777 

Leu951-Leu956 

Leu831-Leu836 

Leu752-Leu757 

Leu573-Leu578 

Ile785-Leu790 

He770-Gly776 

Ile775-Gty781 



Examples 

The following are examples of methods for determining the processivity and cycle times for various 
polymerases. Also provided are examples for determining the level of discrimination by a polymerase, and 
other methods useful in this invention. 
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Example 1. Mutagenesis of DNA polymerase q n sand overproduction of mutant DNA polym rases 

Standard techniques ar used for the cloning and expression of mutant DNA polym rase genes. The 
genes for the large fragment of E. coli DNA polymerase I (Klenow fragm nt) and the larg fragment of Taq 
DNA polymerase (KlenTaq or _Taq DNA polymerase, see Barnes 112 Gene 29. 1992 or Stoffel fragment, 
s e Lawyer et al. 2 PCR Methods Appl 275. 1993). the starting materials for the generation of mutants in E. 
coli DNA polymerase I and Taq DNA polymerase, are expressed under the control of the T7 RNA 
polymerase promoter. The gene for the _28 amino acid deletion of T7 DNA polymerase (see Tabor and 
Richardson 264, J. Biol. Chem. 6447, 1989), the starting material for the generation of mutants in T7 DNA 
polymerase, is expressed under the control of the lac promoter in a strain that produces E. coli thioredoxin, 
a necessary factor for processive DNA synthesis by T7 DNA polymerase (Tabor and Richardson, supra). 
The gene for Taq DNA polymerase mutant F667Y is transferred from the gene that produces _Taq DNA 
polymerase to the gene that produces the full length Taq DNA polymerase by standard techniques using 
PCR and restriction digestion followed by ligation. 

Mutagenesis to construct the mutant DNA polymerases is carried out using standard mutagenesis 
techniques by PCR similar to the method described by Sarkar and Sommer 8 BloTechniques 404. 1990. 
To construct hybrids in which more than four amino acid residues are being exchanged, two hybrid primers 
are constructed whereby PCR is first carried out on the donor DNA, and then that product is used for PCR 
on the recipient DNA, generating the hybrid molecule. For the construction of hybrids in which the 
exchange of domains is four amino acid residues or less, single PCR primers are synthesized that contain 
the entire region to be transferred as well as the proper flanking sequences of the recipient, and that primer 
is used to construct the hybrid molecule directly. 

Overproduction of the mutant DNA polymerases is carried out using standard techniques (see for 
example Current Protocols in Molecular Biology. Ausubel et al. eds.. Chapter 16. 1994). Mutant proteins are 
purified by standard procedures including ion exchange chromatography. For the purification of E. coli DNA 
polymerase I mutants, see for example Joyce and Grindley 80 Proc. Natl. Acad. ScL 1830, 1983. For the 
purification of Taq DNA polymerase mutants, see for example Engelke et al. 191 Analytical Biochemistry 
396. 1990. For the purification of T7 DNA polymerase mutants, see for example Tabor and Richardson 264, 
J. Biol. Chem. 6447. 1989. Polymerase specific activities of each of the purified mutant proteins are 
determined by standard procedures described in these references. 

Example 2. Rapid screen of DNA polymerases for mutants that are improved in their efficiency of 
incorporating a dideoxynucleotide relative to a deoxynucleotide 

Mutant DNA polymerases are screened for their ability to incorporate dideoxynucleotides by SDS 
activity gel analysis. The procedure is a modification of that described by Spanos and HUbscher 91 
Methods in Enzymology 263. 1983 and Karawya et al. 135 Analytical Biochemistry 318, 1983. Briefly, 10 
ml of cells are induced for 4 to 6 hours and then pelleted. The cell pellet resuspended in 0.3 ml 25 mM 
Tris-HCI, pH 7.0, 5 mM EDTA. 20 ul of the resuspended cells are mixea with 40 ul of a solution of 1% 
SDS (sodium dodecyl sulfate). 2% mercaptoethanol, 30 % glycerol. 0.04% bromphenol blue, and 100 mM 
Tris«HCI. pH 6.8. The mixtures are incubated at 37 *C for 5 min. and then 20 ul aliquots are loaded in 
duplicate onto two SDS polyacrylamide gels. The SDS polyacrylamide gels consist of a resolving gel that 
contains 8% polyacrylamide. 0.27% bisacrylamide. 190 mM Tris^HCI. pH 8.8. 0.05% SDS. and 25 ug/ml 
denatured salmon sperm DNA. and a stacking gel that consists cf 5% polyacrylamide, 0.17% 
bisacrylamide, 150 mM Tris^HCI, pH 6.8, and 0.1% SDS. The two gels are electrophoresed at 100 V for 13 
hr at a constant temperature of 13 *C in an electrophoresis buffer consisting of 190 mM Tris*HCI and pH 
8.8. 0.05% SDS. 

After electrophoresis, the gels are washed over 8 hr in 4 changes of 500 ml each of Renaturation Buffer 
(50 mM Tris.HCI. pH 7.5, 5 mM magnesium acetate, 1 mM dithiothreitol, 40 mM KCI, 400 ug/ml bovine 
serum albumin, 16% glycerol and 0.95 mM EDTA) at 4 *C. 

The renatured proteins are assayed for DNA polymerase activity by incubating each of the two gels in 6 
ml of Renaturation Buffer, 1.5 uM 4dNTPs. 4 ul of [a- 32 P]dATP (800 Ci/mmol, 10 mCi/ml), and 80 ug of 
purified thioredoxin. One of the mixtures also contains 30 uM ddTTP (a 20-fold molar excess over dTTP). 
The mixtures are incubated for 4 hr at 37 • C (70 * C for 2 hr for thermophilic DNA polymerases). 

After incubation the gels are washed for 8 hr against four changes of 5% trichloracetic acid and 1% 
sodium pyrophosphate. The gels are. then dried and autoradiographed. 

To determine whether a mutant DNA polymerase is discriminating more or less against ddTTP, th 
intensities of the radioactive bands are compared on the two gels that were incubated in the presence and 
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abs nc of ddTTP, and the ratio of the signal in th two bands for the unmodified DNA polym rase is 
compared with the ratio of the signai in the tw bands for each of th mutants. If a mutation r suits in a 
ONA polym rase being less discriminatory towards ddTTP, th n there will be a greater percentage 
decrease in radioactivity in the band in which ddTTP was present for th mutant DNA polym rase 

5 compared with th unmodified DNA polymerase. For example, under thes conditions the radioactive bands 
observed for cells containing induced E. coli DNA polymerase I or T7 DNA polymeras mutant Y526F are 
approximately the same intensity (within a factor of two) in reactions carried out in the presence versus the 
absence of ddTTP. In contrast, for cells containing induced E. coli DNA polymerase I mutant F762Y or T7 
DNA polymerase, the bands on the gel in which the reactions are carried out in the presence of ddTTP are 

10 less that 5% the intensity of the bands corresponding to the reactions carried out in the absence of ddTTP. 
This assay represents a rapid method of screening a large number of DNA polymerase mutants for 
their ability to discriminate against dideoxynucleotides. It can detect changes of at least 5-fold in the relative 
rate of discrimination. However, this assay should be followed up with purification of potentially interesting 
mutant DNA polymerases and more rigorous assays of the purified proteins similar to those described 

rs below in order to determine precisely the effect of each mutation on discrimination against dideox- 
ynucleotides. 

The following examples are methods for determining the processivity and cycle times for various 
polymerases, determining the level of discrimination against ddNTPs by a polymerase, determining the 
uniformity of bands generated by dideoxy-terminated fragments on a DNA sequencing gel, and using the 
20 DNA polymerases of this invention for DNA sequence analysis. 

Example 3. Preparation and purification of a single-stranded M13 DNA - 5' 32 P-labeled 40-mer primer 
complex 

25 The template is Ml 3 mGP1-2 single-stranded DNA, 9950 nucleotides in length, as described in U.S. 
Patent 4,795.699 (Rgure 9). The phage M13 mGPt-2 is deposited in ATCC as number 40303. M13 mGP1- 
2 single-stranded DNA is purified as described in Tabor et a/., 262 J. Biol. Chem. 16212, 1987. Briefly, the 
phage is purified through two CsCI gradient centrifugations, the CsCI is removed by dialysis, the DNA is 
removed from the phage by extraction with phenol and chloroform in the presence of 0.1% sodium dodecyl 

30 sulfate, and the extracted DNA is dialyzed extensively against 20 mM Tris-HCl, pH 7.5, 2 mM EDTA and 
stored at 4 *C. The concentration of M13 mGP1-2 single-stranded DNA is determined spec- 
trophotometrically using an extinction coefficient of 8.1 A26o units = 1 mg/ml, or 0.3 pmoles of M13 mGP1- 
2 template molecules per microliter. 

The primer is a synthetic 40-mer having the sequence 5* d- 

35 (TTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGTGCCA)3' synthesized by standard procedures. It is 
complementary to Ml 3 mGP1-2 DNA at nucleotides 9031 to 8992 (see '699 patent supra for sequence). 
The primer is purified by ion exchange chromatography or denaturing polyacrylamide gel electrophoresis 
prior to end labeling. 

The primer is labeled and annealed to the te "Mate essentially as described in Tabor ef a/ (supra). The 
40 primer is 5' end labeled in a reaction mixture (15 ul) containing 40 mM Tris»HCI, pH 7.5, 10 mM MgCfe. 5 
mM dithiothreitol. 100 mM NaCI, 50 ug/ml BSA. 50 uCi [g^PJATP. 6000 Ci/mmol, 5 pmoles of primer, and 
10 units of T4 polynucleotide kinase prepared from the PseT1 mutant, which is deficient in phosphatase 
activity. The mixture is incubated at 37 *C for 15, min. followed by 70 *C for 15 min to inactivate the kinase. 
60 ul of single-stranded M13 mGP" 2 DNA (0.25 mg/ml), 6 ul of 1M NaCI and 3 ul of 0.2 M MgCb are 
45 added, and the mixture is slowly cooled from 70 * C to room temperature (about 20 - 25 • C) over a period of 
30 min. The mixture is then extracted once with a 1:1 mixture of phenol and chloroform, and after 
centrifugation for 30 sec in a microfuge, the aqueous phase (70 ul) is placed on a 1 ml column of 
Sepharose CL-6B equilibrated in 20 mM Tris«HCI, pH 7.5, 2 mM EDTA, and 100 mM NaCI. The labeled 
primer-template complex is eluted from the column with the same buffer used for equilibration; the labeled 
so complex elutes in the void volume. After elution, the complex is at a concentration of approximately 50 
ug/ml (0.015 pmoles of molecules per ul) with a specific activity of approximately 200,000 cpm/ul. 

Example 4. Determination of the processivity of a DNA polymerase by dilution test 

55 Processivity is determined by enzyme dilution essentially as described in Tabor et al (supra) and 
Tabor and Richardson, 84 Proc. Natl. Acad. Sci USA 4767, 1987. The reactions are carried out under the 
same conditions as in the extension/termination reactions used in DNA sequencing (Tabor and Richardson, 
supra), except that ddNTPs are omitted and the polymerase concentration is reduced in order to have an 
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excess of primer-template molecules over polymerase molecules in some of the r actions The prim r- 
templat consists of th singl 5* end-labeled primer ann aled to a single-stranded M13 DNA molecul as 
described in Exampl 3. 

The xtension reaction mixtures are pr pared substantially as describ d in Tabor et al (supra) Each 
r action mixture (18 ul) contains 1.0 ul annealed ^P-labeled primer-Mi 3 DNA as described in Example 3 
CU015 pmoles. -200.000 cpm), 40 mM Tris«HCI. pH 8.0, 5 mM MgCfe, 5 mM dithiothreitol. and 300 uM 
4dNTPs. The mixtures ar incubated at 37 -C for 1 min (70 •C for thermophilic DNA polymerases). The 
reactions are initiated by the addition of 2 ul aliquots of dilutions of the DNA polymerase being analyzed 
diluted in 20 mM Tris-HCI, pH 7.5. 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin. The 
reaction mixtures are further incubated at 37* C (70' C for thermophilic DNA polymerases) for either 30 sec 
or 3 min. At the indicated times, 8 ul aJiquots are removed and added to either 8 ul of 90% formamide. 20 
mM EDTA. 0.05% bromphenol blue for denaturing polyacrylamide gel electrophoresis, or 2 ul of 100 mM 
EDTA, 2 % sodium dodecyl sulfate for alkaline agarose gel electrophoresis. 

The samples are analyzed by either denaturing polyacrylamide gel electrophoresis or aikaline agarose 
gel electrophoresis. Denaturing polyacrylamide gel electrophoresis is most suitable for analyzing poly- 
merases with an average processivity of less than 500 nucleotides, while alkaline agarose gel elec- 
trophoresis provides a more sensitive estimate of the processivity of DNA polymerases with an average 
processivity greater than 500 nucleotides; however, either method can be used successfully to determine 
the average processivity of any DNA polymerase. 

To determine the processivity by denaturing po!;,acrylamide gel electrophoresis, aliquots in formamide 
are heated at 90 'C for 2 min immediately prior to loading 6 ul of each sample onto a gel consisting of 8% 
polyacrylamide. 0.4% N.N'-methylenebisacrylamide. 7 M urea in 100 mM Tris» borate. pH 8.3. 1 mM EDTA. 
Electrophoresis is at 2000 Volts for 90 min (until the bromphenol blue has just run off the bottom of the 
gel). Suitable 5' ^P-end labeled molecular weight markers are also loaded onto the gel that allow the 
determination of fragment sizes 100 to 500 nucleotides in length. An example of such suitable markers are 
T7 Hpal fragments that have been dephosphorylated with alkaline phosphatase and then 5* ^P-end labeled 
using [g-^PJATP and T4 polynucleotide kinase using standard procedures (Sambrook ef a/., 1989, 
Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y. pp. 6.20-6.21 and Ausubel et 
a/.. 1989. Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley Interscience, 
N.Y.). After electrophoresis, the gel is dried under vacuum, and autoradiographed. After autoradiography] 
the distribution of radioactively labeled fragments is determined by Phosphorimager analysis (Molecular 
Dynamics). 

Products are analyzed by alkaline agarose gel electrophoresis as described by (1) Villani ef al., 256 J 
Biol. Chem. 8202. 1981, by (2) Sabatino et a/.. 27 Biochemistry 2998. 1988. and by (3) Sambrook. et al., 
supra . To prepare the agarose gel. 250 ml of a 1.5% agarose solution in 2 mM EDTA. pH 8.0 is heated in 
a microwave oven to dissolve the agarose, and then allowed to cool to 60 *C. 8.75 ml of 1 N NaOH (final 
concentration 35 mM NaOH) is added, the gel is poured into an agarose gel electrophoresis mold. The gel 
is allowed to set for 2 hours before use. The electrophoresis buffer is 35 mM NaOH and 2 mM EDTA. The 
samples are prepared by taking the 10 ul aliquots described above ;;n 20 mM EDTA, 0.4% sodium dodecyl 
sulfate) and denatured by the addition of 1 ul of 1 N NaOH and heating at 60 *C for 10 min. 7 ul of 75 % 
glycerol. 0.2 % bromcresol green are added to each sample, and then the samples are loaded onto the 
alkaline agarose gel. Electrophoresis is carried out at 4*C at a constant current of 150 mA for 15 hours 
(until the bromcresol green has migrated about 14 cm). The electrophoresis chamber has dimensions of 26 
cm (length) x 20 cm (width) by 2 cm (height). After electrophoresis, the gel is soaked for 2 hours in 10% 
trichloracetic acid, dried under vacuum, and autoradiographed. After autoradiography, the distribution of 
radioactively labeled fragments is determined by Phosphorimager analysis (Molecular Dynamics). 

To test the processivity of a given DNA polymerase, the concentration of that polymerase in the 
reaction mixture is diluted by 2-fold increments until only a fraction (e.g. 25%) of the primers are extended, 
while the majority (e.g. 75%) remained unchanged, 40 nucleotides in length. Under these conditions, a two- 
fold increase or decrease in DNA polymerase concentration should result in about a two fold increase or 
decrease, respectively, in the fraction of primers extended. The average length of the labeled fragments that 
are extended is determined either by visual inspection of the autoradiograph or by quantitation using a 
phosphoimager. For example, using this test, the exonuclease-deficient T7 DNA polymerase complexed to 
its processivity factor thioredoxin (e.g. SEQUENASE® Version 2.0. United States Biochemical Corporation) 
has an average processivity of more than 500 nucleotides, while Klenow fragment of E. colt DNA 
polymerase I has a processivity of less than 50 nucleotides. 
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Example 5. Pet rmination of the rate of cycling of a DNA polymerase 

The rat of cycling is determined essentially as described Tabor et al (supra). The reactions ar 
carried out under the same conditions as in the extension/termination reactions used in DNA sequencing 
5 (Tabor and Richardson, supra), except that ddNTPs are omitted and the polymerase concentration is 
reduced in order to have an excess of primer-template molecules over polymerase molecules. The primer- 
template consists of the S't^PJend-labeled primer annealed to single-stranded M13 DNA molecule as 
described in Example 3. 

First a test is carried out to determine the functional ratio of primer-template molecules to polymerase 
w molecules, for example, using the large fragment of E. coli DNA polymerase I (Klenow fragment). A dilution 
experiment is carried out as described in Example 4, to determine the concentration of polymerase 
molecules necessary to extend 20% of the labeled primer-template molecules in 10 sec. This ratio of 
polymerase to primer-template is defined as being less than or equal to 1:5, and is used to determine the 
maximum rate of cycling of the polymerase as described below. 

75 Extension reactions are carried out as described in Example 4 using the ratio of DNA polymerase to 
primer-template less than or equal to 1:5, as defined in the previous paragraph. Reactions are carried out 
under conditions (I.e. buffer, pH, salt, temperature) optimal for the polymerase being tested. Aliquots are 
removed at 10 sec, 20 sec, 40 sec and 80 sec, and the reactions are terminated and the products analyzed 
as described in Example 4. For DNA polymerases with low processivity (less than 100 nucleotides), such as 

20 the large fragment of E. coli DNA polymerase i, samples are analyzed preferably by denaturing 
polyacrylamide gel electrophoresis. For DNA polymerases with high processivity (greater than 100 
nucleotides), such as T7 DNA polymerase, samples are analyzed preferably by alkaline agarose gel 
electrophoresis. After electrophoresis, the gels are dried under vacuum and analyzed either by autoradiog- 
raphy or with a Phosphoimager. 

25 When reactions are carried out with a polymerase that cycles very slowly, such as T7 DNA polymerase, 
there is not a significant decrease (I.e. less than a factor of two) in the number of unextended primers 
between 10 sec and 80 sec. Thus, if the number of unextended labeled primers does not decrease by more 
than two fold between the 10 sec and 80 sec time points, then the DNA polymerase cycles slower than 
once per 70 sec. For polymerases that cycle rapidly, the number of unextended primers will decrease by a 

30 significant fraction (i.e. by more than 2 fold) between the 10 and 80 sec time points. To determine the rate 
of cycling for these polymerases, the following equation is used: 

R = N, x L(t2)/{L(ti)x(t2-ti)} 

35 where: 

R = minimum rate of cycling in cycles per sec. 

Ni = ratio of primer-template molecules over functional DNA polymerase molecules. (Ni = 5 in the 
example above.) 

L(ti) = the maximum p.^cessivity of the DNA polymerase being investigated, in nucleotides. This is 
40 defined as the maximum number of nucleotides extended from the labeled primers under conditions of 
limiting DNA polymerase (where only 20% of the primers are extended in the 10 sec time point), as 
described in Example 3. 

Mb) = the maximum length of extension (in nucleotides) of the labeled primers at time fc, which is 80 
sec in this example. 

45 tt : the shortest time at which an aliquot is taken, or 10 sec in this example. 

fc: the longest time the reaction is allowed to precede before removing an aliquot, or 80 sec in this 
example. 

When this test is carried out for the large fragment of E. coli DNA polymerase I, a value of more than 
0.2 cycles per sec is obtained. 

so Examples 6 and 7 provide tests to determine the efficiency of incorporation of dideoxynucleotides for 
an unknown DNA polymerase using a 5* ^P-labeled 40-mer primer annealed to a single-stranded M13 DNA 
template and a gel electrophoresis-based analysis. Example 6 is best for DNA polymerases that efficiently 
incorporates dideoxynucleotides (e.g. wild-type T7 DNA polymerase and Taq DNA polymerase F667Y) 
while Example 7 is best for DNA polymerases that discriminate strongly against the incorporation of 

55 dideoxynucleotides (e.g. T7 DNA polymerase Y526F and wild-type Taq DNA polymerase). 
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Exampl 6. Gel electr ophoresis-based determination of the rat of incorporation of dideoxynucl eotid s 
r lative to deoxynucleotides using a 1:1 rati of dNTPs to ddNTPs. " 

The primary application of this test is to determine the absolute ratio of incorporation of a ddNMP to a 
dNMP for a DNA polymerase that efficiently incorporates dideoxynucleotides, such as T7 DNA polymerase, 
E. coli DNA polymerase I mutant F762Y or Taq DNA polymeras mutant F667Y. It can also indicate th ' 
level of discrimination against ddNTPs of any DNA polymerase; however, for DNA polymerases that 
discriminate strongly against ddNTPs, such as 17 DNA polymerase mutant Y526F, E. coli DNA polymerase 
I or Taq DNA polymerase, higher ratios of ddNTP to dNTP are necessary to determine precisely their level 
of discrimination, which is described in detail in Example 7. 

DNA synthesis reactions are carried out on the ^P-end labeled 40 mer-M13 mGP1-2 DNA template 
complex prepared as described in Example 3. Reaction conditions are used that are optimum for the DNA 
polymerase being tested with regard to buffer, pH, salts, and temperature of the reaction. A concentration of 
DNA polymerase is chosen whereby most of the primers are extended in a 10 min reaction and are 
terminated by the Incorporation of a dideoxynucleotide. The reaction mixture contains 100 uM 4dNTPs and 
100 uM of one of the four ddNTPs. 

We used this test to compare the ability of six DNA polymerases to incorporate each of the four 
ddNMPs. The DNA polymerases tested were (1) T7 DNA polymerase with a 28 amino acid deletion in the 
exonuclease domain and complexed in a one-to-one ratio with thioredoxin (Tabor and Richardson 264 J. 
Biol. Chem. 6447, 1989) (referred to here as "T7 DNA polymerase"), (2) The large fragment of E. coli DNA 
polymerase I, commonly called the Klenow fragment (referred to here as "E. coli DNA polymerase I"), (3) 
Unmodified DNA polymerase from Thermus aquaticus (referred to here as "Taq DNA polymerase"). (4) T7 
DNA polymerase as described above in which the tyrosine at residue 526 has been changed to a 
phenylalanine (referred to here as "T7 DNA polymerase Y526F"), (5) E. coli DNA polymerase I as 
described above in which the phenylalanine at residua 762 has been changed to a tyrosine (referred to here 
as "E. coli DNA polymerase I F762Y"), and (6) Taq DNA polymerase as described above in which the 
phenylalanine at residue 667 has been changed to a tyrosine (referred to here as "Taq DNA polymerase 
F667Y"). 

To test the relative rate of use of each of the four ddNTPs compared to the comparable dNTPs for each 
of the DNA polymerases indicated above, the reaction mixtures (8 ul) contained 1.0 ul annealed ^P-labeled 
primer-M13 DNA as described in Example 3 ("0.015 pmoles, "200.000 cpm). 40 mM Tris«HCI, pH 8.0. 5 
mM MgCI 2 , 5 mM dithiothreitol, 50 mM NaCI, 100 uM 4dNTPs, and 100 uM ddCTP. The reaction mixtures 
also contained 10 ng of yeast inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise 
increase the apparent discrimination by the DNA polymerase (Tabor and Richardson 265 / Biol. Chem. 
8322, 1990). The reactions were initiated by the addition of 2 ul of each DNA polymerase, diluted in 20 mM 
Tris»HCI, pH 7.5. 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin to a concentration of 
approximately 0.025 units/ul. The concentration of each DNA polymerase was sufficient to extend most of 
the labeled primers by more than 500 nucleotides in the absence of ddNTPs in a 15 min reaction. The 
reaction mixtures were incubated for 15 min at either 37 *C <T7 DNA polymerase, T7 DNA polymerase 
Y526F, E. coli DNA polymerase I. and E. coli DNA polymerase I F762Y) or 70 • C (Taq DNA polymerase 
and Taq DNA polymerase F667Y). The reactions were terminated by the addition of 10 ul 90% formamide, 
20 mM EDTA. 0.05% bromphenol blue. Each sample was heated at 90*0 for 2 min immediately prior to 
loading 6 ul of each sample onto a gel consisting of 8% poiyacrylamide. 0.4% N,N'- 
methylenebisacrylamide, .7 M urea in 100 mM Tris* borate, pH 8.3, 1 mM EDTA. Electrophoresis was at 
2000 Volts for 90 min (until the bromphenol blue had just run off the bottom of the gel). After 
electrophoresis, the gel was dried under vacuum, and autoradiographed. After autoradiography, the 
distribution of radioactively labeled fragments was determined by Phosphorimager analysis (Molecular 
Dynamics). Alternatively, the relative intensities of dideoxy-terminated bands can be determined by 
scanning the autoradiograph using an instrument such as the SciScan 5000 imaging densitometer (United 
States Biochemical Corp). 

When the set of four reactions (each containing a single ddNTP at an equimolar concentration as the 
dNTP) was carried out with each of the six DNA polymerases described above, the reactions with three of 
the DNA polymerases (T7 DNA polymerase Y526F, E. colt DNA polymerase I and Taq DNA polymerase) 
resulted in most (>50%) of the radioactivity in the primers that had been extended migrating at the top of 
the gel, corresponding to fragments greater than 300 bases in length. Based on the predicted exponential 
decay of signal with increasing fragment size, this corresponds to discrimination by these three DNA 
polymerases of more than 100 fold against all four ddNTPs. A more precise measurement of the 
discrimination against ddNTPs by these three DNA polymerases is obtained using the test in Example 7 
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below. 

For the oth r three DNA polymerases (T7 DNA polymerase, E. coli DNA polym rase I F762Y and Taq 
DNA polymerase F667Y) the autoradiograph showed a series of dideoxy-terminated fragm nts with all of 
the reactions. In general, the average lengths of the labeled synthesized fragments were lowest for Taq 

5 DNA polymerase F667Y, with only about six radioactively labeled dideoxy-terminated fragm nts visible with 
even a several day exposure of the film. The average lengths of labeled fragments with E. coli DNA 
polymerase I F762Y are slightly longer than with Taq DNA polymerase F667Y, while the average lengths 
are significantly longer with T7 DNA polymerase. The fragments are more uniform in intensity when 
synthesized by E. coli DNA polymerase I F762Y and Taq DNA polymerase F667Y that by T7 DNA 

w polymerase. 

The distribution of radioactivity in the fragments was quantitated by Phosphoimager analysis (Molecular 
Dynamics). The total amount of labeled primers in each lane was determined by turning three control 
reactions in which no DNA polymerase was present, and the radioactivity in each of the corresponding 
radioactive bands on the gel at the position of the unextended primer was determined. With some 

75 preparations of radioactively labeled primers, a certain percentage (<10%) is not extended by any of the 
DNA polymerases, regardless of the concentration of DNA polymerase used; this background level is 
determined by measuring the percentage of radioactivity remaining at the position of unextended primer for 
a series of four reactions containing ddNTPs, and subtracting the average of these four values from the total 
number of counts determined previously. This value is defined as the total number of counts in primers that 

20 are capable of being extended by a DNA polymerase. 

The total number of counts (i.e. radioactivity) in the first three dideoxy-terminated fragments were 
determined for T7 DNA polymerase, E. coli DNA polymerase I F762Y and Taq DNA polymerase F667Y for 
each of the four ddNTP reactions. The values are presented in the Table below as the percentage of counts 
in the first three dideoxy-terminated fragments to the total number of counts in the primers capable of being 

25 extended by a DNA polymerase. 



Polymerase Reaction 




ddGTP 


ddATP 


ddTTP 


ddCTP 


T7 DNA polymerase 


67% 


66% 


76% 


61% 


E. coli DNA polymerase 1 F762Y 


95% 


92% 


96% 


92% 


Taq DNA polymerase F667Y 


97% 


95% 


95% 


99% 



35 As a further test of the efficiency of each DNA polymerase to incorporate dideoxy nucleotides, the 
number of counts in each fragment with a significant signal was determined for each reaction, and the data 
were plotted as a function of the fragment number using the Macintosh program Kaleidograph Version 3.0 
(Synergy Software). The resulting plots were fit to an exponential decay curve using the Kaleidograph 
library routine for this function. The decay curve is given by the equation: 

40 

y ~ Q M " x 

where: 

Y = 1 - (the fraction of labeled primers in fragments 1 to X compared to the total number of primers 
45 that can be extended) 

X = the fragment number (the first dideoxy-terminated fragment is 1) 

M = the exponential decay function calculated for the data by the Kaleidograph library routine. 
In the Table below, the following data are provided for each of the four ddNTP reactions using T7 DNA 
polymerase, E. coli DNA polymerase I F762Y and Taq DNA polymerase F667Y: 
so N, the number of fragments used to fit each exponential curve. 

M, the calculated exponential decay function as described above. 

D, the discrimination factor given as the ratio of the use of a specific dNTP to the use of the 
comparable ddNTP when both nucleotides are present at equal concentrations. D is calculated from the 
equation above using the calculated value of M to determine Y when X = 1, and defining D, the ratio of 
55 preference of the dNTP to the ddNTP, as Y/(1-Y>. 

R 2 , the correlation index for the data, was calculated by the Kaleidograph library routin . This is a 
measure of the variability in band intensities, or the sequence-specific variability in the ability of a DNA 
polymerase to incorporate the specific dideoxynucleotide. 
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will increase the fluorescenc of fragments of shorter length. (2) Taq DNA polym ras F667Y is used in 
place of Taq DNA polymerase. The same number of units of enzyme are used in both cases, when 
assayed under standard DNA polymerase asS ay conditions. Alternatively, the same number of DNA 
polymerase molecules can be used. (3) The reaction mixtures contain 10 ng of inorganic pyrophosphatase 
s to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA 
polymerase at specific sequences, reducing the uniformity of band intensities (Tabor and Richardson 265 J. 
Biol. Chem. 8322, 1990). Preferably this pyrophosphatase is purified from a thermophilic organism, for 
example from Thermus thermophllus (Hohne et a/. 47 Biomed. Biochim. Acta 941, 1988). All other 
aspects of the procedures areidentical.to that described in the Applied Biosy stems manual (supra). 

Example 20. Automated cycle DNA sequencing using modified Taq DNA polymerase and fluorescent dye 
terminators 

Cycle DNA sequencing with a thermostable DNA polymerase such as Taq DNA polymerase F667Y and 
is the Applied Biosystems DyeDeoxy Terminators is a modification of the procedure in the Applied 
Biosystems manual (Part Number 901497, Rev. E). The procedure is identical to that described in the 
manual with the following modifications: (1) The manual calls for the use of 1 ul of each of the 4 DyeDeoxy 
Terminators undiluted in each sequencing reaction (20 ul reaction). In this Example the Terminators should 
be diluted prior to addition to the sequencing taction mixture, since they are incorporated more than 
20 several hundred fold more efficiently with Taq DNA polymerase F667Y than with Taq DNA polymerase. 1 ul 
of each of the following dilutions are added to each sequencing reaction in place of 1 ul of the undiluted 
Terminator solutions: 

DyeDeoxy G Terminator, 1 to 500 in H 2 0 
DyeDeoxy A Terminator, 1 to 1,500 in H 2 0 
25 DyeDeoxy T Terminator, 1 to 1,500 in H 2 0 
DyeDeoxy C Terminator, 1 to 1.000 in H 2 0 

These dilutions are approximations only; the precise dilution of each DyeDeoxy Terminator should be 
determined empirically depending on the number of bases of DNA sequence to be determined from the 
primer. (2) Taq DNA polymerase F667Y is used in place of Taq DNA polymerase. The same number of 

30 units of enzyme are used in both cases, when assayed under standard DNA polymerase assay conditions. 
Alternatively, the same number of DNA polymerase molecules can be used. (3) The reaction mixtures 
contain 10 ng of inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the 
apparent discrimination by the DNA polymerase at specific sequences, reducing the uniformity of band 
intensities (Tabor and Richardson 265 J. Biol. Chem. 8322, 1990). Preferably this pyrophosphatase is 

35 purified from a thermophilic organism, for example from Thermus thermophilus (Hohne et a/. 47 Biomed. 
Biochim. Acta 941, 1988). 

Since this procedure uses over 500 times less DyeDeoxy Terminators that previous procedures, there 
is much less of a problem with unincorporated DyeDeoxy Terminators after the reactions is complete. Thus 
it is not necessary to remove the unincorporated DyeDeoxy Terminators by passing the sample over a spin 

40 column, as recommended in the Applied Biosystems manual {supra). Rather, the sample can be precipitat- 
ed with ethanol, taken up in 5 ul deionized formamide and 1 ul 50 mM EDTA, pH 8.0. heated at 90 # C for 
2 min, and loaded onto the Applied Biosystems 373A DNA Sequencing System according to the 
instructions in the 373A User's Manual. It is possible that with a DNA polymerase that incorporates the 
DyeDeoxy Terminators efficiently (such as Taq DNA polymerase F667Y) that concentration of the DNA 

45 sequencing reaction by precipitation with ethanol is not necessary; the DNA cycle sequencing reactions, 
carried out using preferably a high concentration of primer and dNTPs (see below), can be terminated by 
the addition of an equal volume of deionized formamide, heated at 90 *C for 2 min, and loaded 
immediately onto the Applied Biosystems 373 A DNA Sequencing System. This represents a major savings 
in time to the researcher preparing the samples for DNA sequence determination. 

so The procedure described above uses a relatively low concentration of dNTPs at the outset (7.5 uM 
dATP, dTTP and dCTP and 37.5 uM dITP). The concentration of dNTPs decrease during the cycle DNA 
sequencing reaction as the dNTPs are used. This concentration of dNTPs (less than 7.5 uM) is less than, 
optimal for maximum DNA polymerase activity for most DNA polymerases. This low concentration was 
necessary with previous protocols because the DNA polymerase that was used discriminated strongly 

55 against ddNTPs, requiring a high ratio of ddNTPs to dNTPs. The use of a DNA polymerase of this invention 
that discriminates much less against the DyeDeoxy Terminators now allows the use of higher concentra- 
tions of dNTPs. For example, in the protocol described above a 10-fold higher concentration of dNTPs and 
DyeDeoxy Terminators could be used; i.e.. 75 uM dATP, dTTP and dCTP, 375 uM dITP, and the following 
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dilutions of each of th four DyeDeoxy Terminators: DyeOeoxy G Terminator, 1 to 50 in H 2 0; Dy Deoxy A 
Terminator, 1 to 150 in H 2 0; DyeDeoxy T Terminator, 1 to 150 in H 2 0; DyeDeoxy C Terminator, 1 to 100 in 
H 2 0. Thus in this example the DyeDeoxy Terminator concentrations are still at least lower than that in 
previous protocols by a factor of at least 50. 

Example 21. Exonuclease assay using a pPJddAMP terminated DNA substrate 

The S'pPJddAMP-terminated DNA substrate is prepared by digesting native calf thymus DNA with 
Hindlll in a reaction mixture (50 ul) that contains 10 ug double-stranded calf thymus DNA, 40 mM Tris.HCI. 
pH 7.5, 10 mM MgCI 2 , 5 mM dithiothreitol, 50 mM NaCI, 50 ug/ml bovine serum albumin, and 10 units 
Hindlll. After incubation at 37 *C for 60 min, 5 ul of [a-^PJddATP (Amersham PB10235, >5000 Ci/mmol) 
and 5 units Sequenase Version 2.0 (United States Biochemical Corporation, catalog number 70775) are 
added, and the mixture is incubated at 20 *C for 15 min. The reaction mixture is extracted once with an 
equal volume phenol:chloroform:isoamyl alcohol (24:24:1), and fractionated on a 1 ml column of Sephadex 
G100 (Pharmacia) in 20 mM Tris«HCI, pH 7.5, 2 mM EDTA t 100 mM NaCI. The 3' ^P-labeled DNA, that 
elutes in the void volume, has a specific activity of approximately 500 cpm/ng of total DNA. 

Reaction mixtures for exonuclease assays (90 ul) contain 40 mM Tris«HCI, pH 7.5, 10 mM MgCI 2 . 10 
mM dithiothreitol, 50 mM KCI and 1 nmol 3* ^P-labeled DNA. The reaction mixtures also contain 10 ng of 
yeast inorganic pyrophosphatase to remove trace amounts of pyrophosphate and thus prevent 
pyrophosphorolysis (Tabor and Richardson 265 / Biol. Chem. 8322. 1990). This mixture is preincubated at 
20 *C for 1 min, then 10 ul of the appropriate enzyme dilution is added. After incubation at 37 *C for the 
indicated times, the reactions are stopped by the addition of 30 ul of bovine serum albumin (10 mg/ml) and 
30 ul of trichloracetic acid (100% w/v). The precipitated DNA is incubated at 0 *C for 15 min, and then 
pelleted by centrifugation at 12,000 g for 30 min. The acid-soluble radioactivity is measured in 100 ul of the 
supernatant. One unit of 3'pPJddAMP-DNA exonuclease activity catalyzes the acid solubilization of one 
pmol of pPJddAMP in 15 min. 

Other embodiments are within the following claims. 
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(B) TELEFAX: 

(C) TELEX: 

(iii) NUMBER OF SEQUENCES : 

(iv) TITLE OF INVENTION: 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



Moon, Donald Keith 
Brewer & Son 
Quality House 
Quality Court 
Chancery Lane 
London 

UNITED KINGDOM 
WC2A 1HT 

(071) 242-2047 
(071) 831-0298 
28244 
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DNA POLYMERASES HAVING 
MODIFIED NUCLEOTIDE BINDING 
SITE FOR DNA SEQUENCING 



3.5" Diskette, 1.44 Mb 
storage 

IBM PS/2 Model 50Z or 55SX 
IBM P.C. DOS (Version 3.30) 
WordPerfect (Version 5.0) 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: NOT YET ASSIGNED 

(B) FILING DATE: 



(vii) PRIOR APPLICATION DATA: 

Prior applications total, 
including application 

described below: TWO 

50 

(A) APPLICATION NUMBER: US 08/324,437 
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(B) RILING DATE: 17-OCT-1994 

(A) APPLICATION NUMBER: US NOT YET ASSIGNED 

(B) RILING DATE: 10-NOV-1994 

(2) INRORMATION ROR SEQ ID NO: 1: 

( i ) SEQUENCE CHARACTERISTICS : ^ - ^ 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 1: 

Arg Arg Ser Ala Lys Ala lie Asn Phe Gly Leu lie Tyr Gly 

5 10 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 

Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 

Arg Asp Asn Ala Lys Thr Phe lie Tyr Gly Phe Leu Tyr Gly 

5 10 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 

Arg Asp Asn Ala Lys Thr Phe lie Tyr Gly Phe Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 

Arg Arg Ser Ala Lys Ala lie Asn Phe Gly Leu lie Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 

Arg Arg Ser Ala Lys Thr Phe lie Tyr Gly Phe Leu Tyr Gly 

5 10 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 



Arg Asp Asn Ala Lys Ala lie Asn Phe Gly Phe Leu Tyr Gly 
5 10 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 

Arg Asp Asn Ala Lys Ala lie lie Tyr Gly Phe Leu Tyr Gly 
5 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 

Arg Asp Asn Ala Lys Thr Phe Asn Phe Gly Phe Leu Tyr Gly 
5 io 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 
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Arg Asp Asn Ala Lys Thr Phe Asn Tyr Gly Phe Leu Tyr Gly 
5 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

Arg Asp Asn Ala Lys Thr Phe He Phe Gly Phe Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 

Arg Arg Ser Ala Lys Ala He Asn Phe Gly Leu He Tyr Gly 
5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 

Arg Asp Asn Ala Lys Thr Phe He Tyr Gly Phe Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 

Arg Arg Ser Ala Lys Thr Phe lie Tyr Gly Leu He Tyr Gly 
5 10 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

<*> TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 

Arg Arg Ser Ala Lys Thr Phe Asn Phe Gly Leu He Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 16: 

Arg Arg Ser Ala Lys Ala He He Tyr Gly Leu He Tyr Gly 
5 10 

(2) INFORMATION FOR SEQ ID NO: 17; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

< B > TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 

Arg Arg Ser Ala Lys Ala He He Phe Gly Leu He Tyr Gly 
5 10 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 

Arg Arg Ser Ala Lys Ala lie Asn Tyr Gly Leu lie Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 

Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
5 10 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 

Arg Asp Asn Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
5 10 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 

Arg Arg Ala Ala Lys Thr Phe He Tyr Gly Phe Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 



48 



EP 0 655 506 A1 



(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 

Arg Arg Ala Ala Lys Thr lie lie Tyr Gly Val Leu Tyr Gly 

5 10 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 23: 

Arg Arg Ala Ala Lys Thr lie lie Phe Gly Val Leu Tyr Gly 

5 10 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) -TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 

Arg Arg Ala Ala Lys Thr lie Asn Tyr Gly Val Leu Tyr Gly 

5 10 



Claims 

1. Modified gene encoding a modified DNA polymerase wherein said modified gene is modified to 
increase the ability of said polymerase to incorporate a dideoxynucleotide relative to the corresponding 
deoxynucleotide compared to the ability of the corresponding naturally-occurring DNA polymerase by 
at least 20-fold. 

2. The modified gene of claim 1 wherein said modified DNA polymerase has sufficient DNA polymerase 
activity for use in DNA sequencing when combined with any factor necessary for said DNA polymerase 
activity. 

3. The modified gene of claim 1 or claim 2 wherein said modified DNA polymerase has sufficiently low 
exonuclease activity to allow said polymerase to be used in DNA sequencing. 

4. The modified gene of any of claims 1 to 3 wherein said modified DNA polymerase has. a dideox- 
ynucleotide binding site comprising a tyrosine residue. 
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5. Th modified g n of claim 4 wh r in said polym ras is a T7-type DNA polym ras is s lect d from 
th group consisting of T7, T3. 01, 011, H, W31 , gh-1 . Y, A1 1 22, and SP6. 

6. A modified DNA polymerase ncoded by the modified gene of any of claims 1-5. 

5 

7. The modified DNA polymerase of claim 6 wherein said polymerase is a Pol l-type polymerase modified 
at an amino acid site in a region selected from that corresponding to DNA Polymerase I of £. coli in 
regions 666-682, 710-755, 755-784, 798-867, and 914-928. 

io 8. The modified gene of any of claims 1 to 4 wherein said modified DNA polymerase is a thermostable 
enzyme. 

9. The modified gene of claim 8 wherein said thermostable enzyme is selected from the group consisting 
of the DNA polymerase encoded by Thermus aquaticus Thermus thermophitus, Thermus flavus, 

is Bacillus sterothermophilus, Thermococcus li (oralis, and Pyrococcus furiosis.. 

10. The modified gene of any of claims 1 to 5 or 8 and 9 wherein said ability of said polymerase to 
incorporate a dideoxy nucleotide relative to the corresponding deoxy nucleotide is increased at least 25- 
fold compared to the corresponding naturally-occurring DNA polymerase compared with the cor- 

20 responding deoxynucleotide. 

11. The modified gene of claim 10 wherein said ability is increased at least 50-fold. 

12. The modified gene of claim 10 wherein said ability is increased at least 100-fold. 

25 

13. The modified gene of claim 10 wherein said ability is increased at least 500-fold. 

14. Method for production of a modified DNA polymerase having an increased ability to incorporate a 
d ideoxynucleotide relative to the corresponding deoxynucleotide compared to the ability of a cor- 

30 responding naturally-occurring DNA polymerase comprising the steps of: providing a nucleic acid 
molecule encoding a DNA polymerase and mutagenizing said nucleic acid molecule to incorporate one 
or more base changes in the nucleotide base sequence at one or more sites which alter the ability of 
said polymerase encoded by said nucleic acid to incorporate a dideoxynucleotide by at least 20-fold. 

35 15. Method for determining the nucleotide base sequence of a DNA molecule comprising the steps of: 

incubating a DNA molecule annealed with a primer molecule able to hybridize to said DNA 
molecule in a vessel containing at least one deoxynucleoside triphosphate, a DNA polymerase modified 
from a naturally-occurring DNA polymerase to have an increased ability to incorporate a dideox- 
ynucleotide compared to said naturally-occurring polymerase, said polymerase having sufficient DNA 

40 polymerase activity and sufficiently low exonuclease activity to be useful for DNA sequencing, and at 
least one DNA synthesis terminating agent which terminates DNA synthesis at a specific nucleotide 
base; and 

separating the DNA products of the incubating reaction according to the size whereby at least a 
part of the nucleotide base sequence of said DNA molecule can be determined. 

45 

16. The method of claim 15 wherein said DNA polymerase is a thermostable DNA polymerase and said 
sequencing is performed at a temperature above 50 • C. 

17. The method of claim 16 wherein said DNA polymerase is a thermostable DNA polymerase and said 
so sequencing is performed at a temperature above 60 rC. 

18. The method of claim 17 wherein said thermostable enzyme is selected from the group consisting of the 
DNA polymerase encoded by Thermus aquaticus, Thermus thermophilis, Thermus flavus. Bacillus 
sterothermophilus, Thermococcus litoralis, and Pyrococcus furiosis.. 

55 

19. The method of any of claims 15 to 17 wherein said DNA polymerase distinguishes between deox- 
ynucleotides and dideoxynucleotides by less than a factor of 100. 
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20. Th m thod of any of claims 15 to 17 wh r in said DNA polymeras distinguish s betw n deox- 
ynucl otid s and dideoxynucleotides by less than a factor of 50. 

21. Th m thod of any of claims 15 to 20 wh rein said DNA polymerase has less than 1000 units of 
exonuclease activity per mg of polymerase. 

22. A kit for DNA sequencing comprising a modified DNA polymerase modified to increase the ability of 
said polymerase to incorporate a dideoxynucleotide compared to the ability of the corresponding 
naturally-occurring DNA polymerase by at least 20-fold; and a reagent necessary for said sequencing 
selected from the group consisting of dITP, a chain terminating agent. deaza-GTP and a manganese- 
containing compound. 

23. A method for sequencing a strand of DNA comprising the steps of: 

providing said strand hybridized with a primer able to hybridize to said strand, to give an 
hybridized mixture, 

incubating said hybridized mixture with one or more deoxyribonucleoside triphosphates, a modified 
DNA polymerase modified to increase the ability of said polymerase to incorporate a dideoxynucleotide 
compared to the ability of the corresponding naturally-occurring DNA polymerase, and a first chain 
terminating agent, wherein said DNA polymerase causes said primer to be elongated to form a first 
series of first DNA products differing in the length of the elongated primer, each said first DNA product 
having a said chain terminating agent at its elongated end, the number of molecules of each said first 
DNA products being approximately the same for substantially all DNA products differing in length by no 
more than 20 bases, and providing a second chain terminating agent in said hybridized mixture at a 
concentration different from said first chain terminating agent, wherein said DNA polymerase causes 
production of a second series of second DNA products differing in the length of the elongated primer, 
each said second DNA product having said second chain terminating agent at its elongated end. the 
number of molecules of each said second DNA products being approximately the same for substan- 
tially all second DNA products differing in length from each other by from 1 to 20 bases, and being 
distinctly different from the number of molecules of all said first DNA products having a length differing 
by no more than 20 bases from that of said second DNA products. 

24. Method for sequencing a nucleic acid comprising: 

(a) combining an oligonucleotide primer, a nucleic acid to be sequenced, between one and four 
deoxyribonucleoside triphosphates, a modified DNA polymerase modified to increase the ability of 
said polymerase to incorporate a dideoxynucleotide compared to the ability of the corresponding 
naturally-occurring DNA polymerase, and at least two chain terminating agents in different amounts, 
under conditions favoring extension of said oligonucleotide primer to form nucleic acid fragments 
complementary to the nucleic acid to be sequenced; separating the nucleic acid fragments by size; 
and determining nucleic acid sequence wherein said agents are differentiated from each other by 
intensity of a label .n the primer extension products. 

25. An automated DNA sequencing apparatus comprising: 

a reactor comprising reagents which provide at least two series of DNA products formed from a 
single primer and a DNA strand, wherein said reagents comprise a modified DNA polymerase modified 
to increase the ability of said polymerase to incorporate a dideoxynucleotide compared to the ability of 
the corresponding naturally-occurring DNA polymerase, each said DNA product of a said series 
differing in molecular weight and having a chain terminating agent at one end, separating means for 
separating said DNA product along one axis of the separator to form a series of bands, the intensity of 
substantially all nearby bands in a series being approximately the same, and the intensity of 
substantially all nearby bands in any one series being different from those of other series, band reading 
means for determining the position and intensity of each said band after said separating along said 
axis, and computing means that determines the DNA sequence of said DNA strand solely from said 
position and intensity of said bands along said axis and not from the wavelength of emission of light 
from any label that may be present in the separating means. 

26. A method for in vitro mutagenesis of a cloned DNA fragment comprising providing said cloned 
fragment and a modified DNA polymerase modified to increase the ability of said polymerase to 
incorporate a dideoxynucleotide compared to the ability of the corresponding naturally-occurring DNA 
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polym rase, contacting said cloned fragment with said polym rase under conditions for synth sizing a 
DNA strand from said fragm nt. wh r in said conditions caus formation of said DNA strand by 
incorporation of a plurality of individual contiguous bases abl to base-pair with said fragm nt and 
incorporation of a nucleotide base unable to base pair with said fragment. 

27. A method for in vitro mutagenesis of a template DNA fragment comprising providing a primer and said 
template, said primer having contiguous bases able to base-pair with contiguous bases of said template 
except at least one base which is unable to base-pair with said template, and extending said primer 
with a modified DNA polymerase modified to increase the ability of said polymerase to incorporate a 
dideoxynucleotide compared to the ability of the corresponding naturally-occurring DNA polymerase. 

2a A method for producing blunt-ended double-stranded DNA from a linear DNA molecule having a 5' end 
comprising a single-stranded region, wherein the 3* end of said molecule is double stranded and has 
no 3' protruding termini, comprising incubating said DNA molecule with a modified DNA polymerase 
modified to increase the ability of said polymerase to incorporate a dideoxynucleotide compared to the 
ability of the corresponding naturally-occurring DNA polymerase, wherein said polymerase acts on said 
single-stranded region to produce a blunt-ended double stranded DNA molecule. 

29. A method for labeling the 3' end of a DNA fragment comprising incubating said DNA fragment with a 
modified DNA polymerase modified to increase the ability of said polymerase to incorporate a 
dideoxynucleotide compared to the ability of the corresponding naturally-occurring DNA polymerase, 
produced from recombinant DNA, and a labelled deoxynucleotide species under conditions in which 
said 3* end of said DNA fragment is extended by said polymerase and thereby labelled by addition of 
said labelled deoxynucleotide to said DNA fragment 

30. A method of amplification of a DNA sequence comprising annealing a first and second primer to 
opposite strands of a double-stranded DNA sequence and incubating the annealed mixture with a 
modified DNA polymerase modified to increase the ability of said polymerase to incorporate a 
dideoxynucleotide compared to the ability of the corresponding naturally-occurring DNA polymerase, 
wherein said first and second primers anneal to opposite strands of said DNA sequence with their 3' 
ends directed towards each other after annealing, and with the DNA sequence to be amplified located 
between the two annealed primers. 

31. Thermus aquaticus DNA polymerase having a tyrosine at residue 667. 

32. E. coii DNA polymerase I having a tyrosine at residue 762. 

33. Pol I type DNA polymerase having a tyrosine residue at the analogous location to £. coli DNA 
polymerase residue 762. 

34. The Pol I type DNA polymerase of claim 33 wherein said location comprises the amino acid sequence 
KNi N2 N3 N* Ns Nc N7 Y G/Q, wherein each N is independently any amino acid. 

35. A DNA polymerase alpha having the sequence K H\ N 2 N 3 N* N s N 6 Y G, wherein each N is 
independently any amino acid, and wherein one said N is mutated to provide a polymerase which does 
not discriminate against ddNMP incorporation more than 50 fold more than dNMP incorporation. 

36. Nucleic acid encoding any of the DNA polymerases of claims 31 - 35. 

37. DNA polymerase which in the presence of magnesium as the only added divalent cation has an 
average processivity of less than 100 and discriminates less than 100 times against incorporation of a 
ddNMP compared to a dNMP, wherein said polymerase is not reverse transcriptase. 

38. DNA polymerase which in the presence of magnesium as the only divalent cation has an average 
processivity of less than 50 and discriminates less than 50 times against incorporation of a ddNMP 
compared to a dNMP. 
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39. ONA polym ras which in the presenc of magnesium as th only divalent cation has an av rag 
processivity of less than 50 and discriminates I ss than 5 times against incorporation of a ddNMP 
compared to a dNMP. 

5 40. Thermophilic DNA polym rase that discriminates against a ddNMP compared to a dNMP by I ss than 
a factor of 100. 

41. Thermophilic DNA polymerase that discriminates in the presence of magnesium as the only divalent 
cation against a ddNMP compared to a dNMP by less than a factor of 100. 

10 

42. The DNA polymerase of claim 41 wherein said polymerase has an average processivity less than 100. 

43. The polymerase of claim 42 wherein said polymerase cycles from one primer-template to another more 
than once per five seconds. 

44. Nucleic acid encoding the polymerase of any of claims 37 - 43. 

45. Method for cycle sequencing using a DNA polymerase of any one of claims 31-35 and 37 - 43. 

20 46. Cellular DNA polymerase having a tyrosine in place of the naturally occurring amino acid at a location 
which causes said polymerase not to discriminate against a ddNMP compared to a dNMP more than 
50 fold. 

47. The DNA polymerase alpha of claim 35 mutated at any of Ni to N 6 to give a polymerase that 
25 incorporates ddNMPs at least 20 times more efficiently than the non-mutated polymerase. 

48. Method for cycle sequencing, comprising the step of providing an excess or equal amount of all four 
dNTPs than each of the four ddNTPs in a cycle sequencing reaction and performing said cycle 
sequencing reaction. 

30 

49. Method for cycle sequencing comprising the step of providing less than 10 times the amount of all four 
fluorescently labelled dideoxynucleotide than the corresponding deoxynucleotides in a cycle sequen- 
cing reaction and performing said cycle sequencing reaction. 

35 50. Modified gene encoding a modified DNA polymerase wherein said modified gene is modified to 
increase the ability of said polymerase to incorporate a dideoxynucleotide or other deoxynucleotide 
analog relative to the corresponding deoxynucleotide compared to the ability of the corresponding 
naturally-occurring DNA polymerase. 

40 51. A modified DNA polymerase encoded by the modified gene of claim 50. 

52. A modified DNA polymerase wherein said modified DNA polymerase is modified to increase the ability 
of said polymerase to incorporate a dideoxynucleotide or other deoxynucleotide analog relative to the 
corresponding deoxynucleotide compared to the ability of the corresponding naturally-occurring Df... 
45 polymerase. 
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Polymerase 


ddNTP 


N 


M 


D 


R 2 


T7 ONA polymerase 


ddGTP 


8 


-0.375 


2.2 


0.813 




ddATP 


6 


-0.356 


2.3 


0.996 




ddTTP 


5 


-0.450 


1.8 


0.997 




UUv 1 r 


Q 
O 


"U.O 1 / 


O 7 


A OOQ 
U.OOO 


E. coli DNA polymerase 1 F762Y 


ddGTP 


5 


-1.03 


0.56 


0.999 




ddATP 


5 


-0.860 


0.72 


0.998 




ddTTP 


5 


-1.06 


0.54 


1.000 




ddCTP 


6 


-0.842 


0.75 


1.000 


Taq DNA polymerase F667Y 


ddGTP 


5 


-1.18 


0.45 


0.995 




ddATP 


6 


-0.997 


0.59 


0.997 




ddTTP 


6 


-1.01 


0.56 


0.996 




ddCTP 


4 


-1.44 


0.32 


0.996 


Averages: 












T7 DNA polymerase 


4 ddNTP 






2.3 


.924 


E. coli DNA polymerase 1 F762Y 


4 ddNTP 






0.64 


.999 


Taq DNA polymerase F667Y 


4 ddNTP 






0.48 


.996 



In summary, T7 DNA polymerase discriminates an average of 2.3 fold against ddNTPs, while E. coli 
DNA polymerase I F762Y and Taq DNA polymerase F667Y actually prefer ddNTPs over dNTPs an average 
of 1.6 fold (1/0.64) and 2.1 fold (1/0.48), respectively. A comparison of R 2 indicates that the intensity of 
neighboring fragments are more uniform with E. coli DNA polymerase I F762Y and Taq DNA polymerase 
F667Y than with T7 DNA polymerase. For a more accurate measure of uniformity, a greater number of 
fragments could be included in the analysis by reducing the level of ddNTPs (for example by 5 fold) in 
each reaction, reducing the decay in intensity at each position (see Example 13). 

To determine the amount of discrimination against ddNTPs by a new DNA polymerase, reactions 
analogous to those described above would be carried out, and identical reactions would be carried out in 
parallel using T7 DNA polymerase (SEQUENASE Version 2.0, United States Biochemical Corporation), with 
all reactions analyzed on the same gel. An initial comparison of the distribution of dideoxy-terminated bands 
obtained with the new DNA polymerase compared with those obtained with T7 DNA polymerase would 
indicate whether the new DNA polymerase discriminated more or less against ddNTPs than T7 DNA 
polymerase. For example, such a visual inspection using E. coli DNA polymerase I F762Y clearly shows 
that for reactions with each of the 4 ddNTPs, the number of fragments visible on the gel in reactions using 
E. coli DNA polymerase I F762Y are less (and smaller in average size) than those using 17 DNA 
polymerase. A more quantitative analysis could then be carried out analogous to that described above in 
order to calculate the exponential decay factor (M), average relative rate of utilization of dNTPs relative to 
dNTPs (D) and variability in intensity (R 2 ) for the new DNA polymerase as described above. 

One complication that can occur in this test is when the DNA polymerase has an associated 
exonuclease activii/. such as the 5' to 3' exonuclease activity associated with Taq DNA polymerase and the 
3' to 5' exonuclease activity associated with E. coli DNA polymerase I and native T7 DNA polymerase (not 
the _28 T7 DNA polymerase deletion mutant used in the experiments above). A 5' to 3' exonuclease 
activity is detrimental since it can remove the label on the 5 1 end of the primer, reducing the radioactivity 
signal being detected. This problem can be partially avoided by reducing the amount of DNA polymerase in 
the reaction mixture. In the example above. 0.025 units of Taq DNA polymerase resulted in virtually all of 
the primers being extended until terminated by incorporation of a dideoxy nucleotide, without appreciable 
loss in radioactivity due to 5' to 3' exonuclease activity, whereas a 40 fold increase in Taq DNA polymerase 
activity, or 1 unit per reaction, resulted in the loss of virtually all 32 P from the 5' ends of the primer. An 
alternative approach to measuring the extent of discrimination for a DNA polymerase with a 5* to 3' 
exonuclease activity is to use a different assay such as those described in Examples 8-10. 

A 3* to 5' exonuclease activity can complicate the assay described above by making the DNA 
polymerase appear to discriminate more against a ddNTP than it actually does (for example, see Tabor and 
Richardson, 86 Proc. Natl. Acad. $ct\ 4076, 1987). This is because once a dideoxynucleotide has been 
incorporated, the exonuclease activity can preferentially remove the dideoxynucleotide so that DNA 
synthesis can continue, resulting in an increase in the length of the fragment. Preferably, the enzymes 
assayed in the test described above are devoid of such 3' to 5' exonuclease activity; examples are modified 
T7 DNA polymerase (Sequenase®, United States Biochemical Corporation), Taq ONA polymerase, ex- 
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onuclease-deficient Vent ( Thermococcus titoraiis) DNA polymerase (New England Biolabs catalog number 
257), exonuclease-deficient Deep Vent (Pyrococcus GB-1) DNA polymerase (New England Bioiabs catalog 
number 259 ). exonuclease-deficient Pfu {Pyrococcus furtosus) DNA polymerase (Stratagerte catalog 
number 600163), and exonuclease-deficient Klenow fragment (E. coli DNA polymerase I, United States 
Biochemical Corporation, catalog number 70057). In some instances, such as with E. coli DNA polymerase I 
(Klenow fragment) the 3' to 5* exonuclease activity is weak and does not interfere significantly with this 
assay (see for example Tabor and Richardson 264 J. Blot. Chem. 6447, 1989). One method to determine 
whether a new DNA polymerase being tested has a 3* to 5' exonuclease activity that is interfering with the 
ability to accurately measure the discrimination against ddNTPs is to carry out the experiment described 
above, removing aliquots at different time points up to 60 min. If the size distribution of the dideoxy- 
terminated fragments increases with time, then it is likely that such a 3' to 5' exonuclease activity is 
interfering with the assay, while if the distribution of fragments is constant over time then such an activity is 
not having a significant effect, tf the average fragment length is increasing with time, then one should use a 
shorter incubation time and/or decrease the DNA polymerase concentration to a range in which the 
fragment sizes remain constant with time. 

Pyrophosphorolysis, or the reversal of the polymerase reaction, can have a similar effect as the 3' to 5* 
exonuclease activity, allowing the DNA polymerase to remove the chain terminating dideoxynucteotide and 
further increase the length of the fragments (see Tabor and Richardson, 265 J. Blot. Chem. 8322, 1990). 
This activity is readily avoided by including pyrophosphatase in the reaction mixture, in order to remove the 
pyrophosphate that accumulates during DNA synthesis and is a necessary substrate for 
pyrophosphorolysis. 

Example 7. Gel electrophoresis-based determination of the rate of incorporation of dideoxynucleotides 
relative to deoxynucleotides by varying the ratio of dNTPs to ddNTPs. 

This example is similar to that described in Example 6. While it is the preferred test for DNA 
polymerases that discriminate strongly against the incorporation of dideoxynucleotides (e.g. T7 DNA 
polymerase Y526F, E. coli DNA polymerase I and Taq DNA polymerase), it also works well with DNA 
polymerases that efficiently incorporate ddNMPs (e.g. T7 DNA polymerase, E, coli DNA polymerase I 
mutant F762Y and Taq DNA polymerase mutant F667Y). 

In this test, the ratio of ddNTP to dNTP is varied for two different DNA polymerase preparations, 
keeping all other aspects of the reactions identical, and the distributions of dideoxy-terminated radioactively 
labeled fragments are compared to determine the ratios required for the two DNA polymerases being tested 
to obtain fragments of comparable average length. 

The average length of a series of fragments is determined in one of two ways. In the first, which is best 
for DNA polymerases that incorporate ddNMPs efficiently, one inspects the autoradiograph and determines 
the position of the largest fragments visible on a given exposure for a series of reactions containing 
ddNTP:dNTP ratios that vary by two-fold increments using one DNA polymerase, and compares that to an 
analogous series using the second DNA polymerase, to determine the ratios required to generate fragments 
of comparable size for the two DNA polymerases. The position of the front marking the appearance of 
visible radioactive bands is usually relatively sharp and readily observed by eye. However, it is also 
possible to determine such positions more precisely using the Phosphotmager to locate the position in each 
lane where a certain threshold of radioactivity per unit «.,ea occurs, starting at the top of the gel and moving 
down the gel. 

Some DNA polymerases discriminate very strongly against the incorporation of dideoxynucleotides, in 
which case it is difficult to add sufficient ddNTPs to the reaction to clearly detect the position of the largest 
dideoxy-terminated fragments on a denaturing polyacrylamide gel. For such DNA polymerases, one can 
use an alkaline agarose gel electrophoresis to compare the lengths of the dideoxy-terminated fragments in 
the different series. If one uses a denaturing polyacrylamide gel. then an alternative method to determine 
the ratios of ddNTP:dNTP required for the two DNA polymerases to generate dideoxy-terminated fragments 
of comparable average lengths is to focus on one or several bands and determine the ratio of ddNTPs to 
dNTPs required to obtain a specific level of radioactivity in those fragments, as analyzed by the 
Phosphoimager, for the two DNA polymerases being tested. 

These tests were carried out using the six DNA polymerases described in Example 6. The reaction 
conditions were identical to that described in Example 6 except for the concentrations of dNTPs and 
ddNTPs. All reaction mixtures contained 10 uM 4 dNTPs. Each of the four ddNTP concentrations were 
varied by two-fold increments in the following ranges for the six DNA polymerases as follows: T7 DNA 
polymerase. E. colt DNA polymerase I F762Y, and Taq DNA polymerase F667Y , 0.02 uM to 1 uM, and T7 
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DNA polymerase Y526F. E. coli DNA polymerase I and Taq DNA polymerase F667Y, 100 to 2,000 uM. The 
reactions wer carried out and the samples were analyzed by denaturing polyacrylamide gel elec- 
trophoresis as described in Example 6. Drying of the gel, autoradiography, and Phosphoimager analysis 
were as described in Example 6. The Table helow summarizes the results from this experiment; the values 
shown for T7 DNA polymerase, E. coli DNA polymerase I F762Y, and Taq DNA polymerase F667Y are the 
absolute ratios obtained in Example 6 by statistical analysis of the rate of exponential decay in intensity of 
dideoxy-terminated fragments obtained using a 1:1 ratio of dNTPs to ddNTPs. The values obtained for T7 
DNA polymerase Y526F, E. coli DNA polymerase I and Taq DNA polymerase were obtained by determining 
the ratios of ddNTP to dNTP required to generate a series of dideoxy-terminated fragments of comparable 
average length to a series generated using T7 DNA polymerase, E. coli DNA polymerase I F762Y, and Taq 
DNA polymerase F687Y. respectively; i.e., for each pair of wild-type and mutant DNA polymerases the 
ratios of ddNTPs:dNTPs were determined that give a comparable distribution of dideoxy-terminated 
fragments. The ddNTP:dNTP ratio used in the reaction with the strongly discriminating enzyme (I.e. the one 
that contains phenylalanine at the critical position) divided by the ddNTP:dNTP ratio used to obtain a 
comparable distribution of dideoxy-terminated fragments with the relatively non-discriminating enzyme (i ©- 
the one that contains tyrosine at the critical position) gives a factor that corresponds to the difference in 
efficiency between the two DNA polymerases in their use of ddNTPs relative to the comparable dNTP. This 
factor was multiplied by the absolute ratios obtained for 17 DNA polymerase, E. coli DNA polymerase I 
F762Y, and Taq DNA polymerase F667Y in Example 6 in order to obtain the values shown below for T7 
DNA polymerase Y526F, E. coli DNA polymerase I and Taq DNA polymerase, respectively. 



Polymerase Incorporation Rate Ratios 




dG/ddG 


dA/ddA 


dT/ddT 


dC/ddC 


T7 DNA polymerase 


3.2 


3.3 


2.8 


3.7 


T7 DNA polymerase Y526F 


6,400 


7,300 


8.400 


11.000 


E. coli DNA polymerase 1 


140 


720 


1.100 


250 


E. coli DNA polymerase 1 F762Y 


0.56 


0.72 . 


o;54 


0.75 


Taq DNA polymerase 


1.400 


4.700 


4,500 


2,600 


Taq DNA polymerase F667Y 


0.45 


0.59 


0.56 


0.32 



The Table below summarizes the effect of having tyrosine in place of phenylalanine at the critical 
selectivity residue of T7 DNA polymerase, E. coli DNA polymerase I and Taq DNA polymerase on the use 
of ddNTPs relative to dNTPs. 



Residue 


Average Rate 


Improvement in dN/ddN 


use of ddNTPs 


T7 DNA polymerase 


Tyrosine (WT) 


3.0 


3,000 X 




Phenylalanine 


8.000 




E. coli DNA polymerase 1 


Phenylalanine (WT) 


600 






Tyrosine 


0.6 


1000 X 


Taq DNA polymerase 


Phenylalanine (WT) 


3.000 






Tyrosine 


0.5 


6,000 X 



To use this test to determine the extent of discrimination of a new DNA polymerase, reactions would be 
carried out as described above initially using a wide range of ratios of ddNTPs to dNTPs, and comparing 
the distribution of dideoxy-terminated fragments on a denaturing polyacrylamide gel to those of a standard, 
e.g. T7 DNA polymerase. Matching the lanes that have comparable average lengths of DNA fragments, the 
ratio of ddNTPs:dNTPs of the new DNA polymerase is divided by the ratio used with T7 DNA polymerase 
to give the level of discrimination against ddNTPs by the new DNA polymerase relative to T7 DNA 
polymerase. 

To use this test to determine whether the modification of a DNA polymerase results in a decrease in its 
ability to discriminate against ddNTPs (i.e. incorporate dideoxynucleotides more efficiently), an identical 
number of units of modified, and unmodified DNA polymerases would be used in a series of reactions 
containing varying ratios of ddNTPs to dNTPs as described above. The average length of dideoxy- 
terminated fragments are compared for identical ratios of ddNTPs to dNTPs for the two enzymes. If the 
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modification has resulted in a ONA polymerase that incorporates dideoxynucteotides more efficiently, the 
average length of dideoxy-terminated fragments wilt be shorter for reactions using the modified ONA 
polymerase compared with those using the unmodified ONA polymerase at the same ratios of ddNTP to 
dNTP, while the average length will be longer for reactions using the modified ONA polymerase if the 

s modification resulted in a ONA polymerase that is more discriminatory towards ddNTPs. 

This test can also be used to determine whether a modification of a ONA polymerase results in a 
decrease in its ability to discriminates against analogs of ddNTPs, for example fluorescently tagged 
ddNTPs. This is possible even if one does not know the concentration of the analogs being tested. As an 
example of this, we compared the ability of Taq ONA polymerase and Taq ONA polymerase F667Y to use 

w each of the four OyeOeoxy Terminators manufactured by Applied Biosystems (part number 401 150). These 
OyeDeoxy Terminators have four different fluorescent moieties covatentty bound to each of the four 
ddNTPs (see Example 12 for more detail). For each of the OyeOeoxy Terminators, the ratio of dNTPs to 
OyeDeoxy Terminators was varied over a 16,000 fold range by intervals of two-fold, and the pattern of 
dideoxy-terminated fragments was compared on the autoradiograph to determine the ratios required for 

i$ each of the two enzymes to obtain the same average length of dideoxy-terminated fragments. The Table 
below summarizes these results. For each Terminator, the column labeled "Ratio" represents the ratio of 
the ratios of ddNTP to dNTP required to give fragments of identical average length for Taq DNA 
- polymerase versus Taq ONA polymerase F667Y. As with normal ddNTPs, Taq ONA polymerase F667Y 
incorporates the fluorescent ddNTP derivatives much more efficiently that does the unmodified Taq ONA 

20 polymerase, by at least a factor of 400. 



DyeDeoxy Terminator 


Ratio 


G Terminator 
A Terminator 
T Terminator 
C Terminator 


>400 
> 2.000 
>2.000 
>2.000 



As discussed previously, one complication that can arise in the use of this test is when the DNA 
polymerase being tested has an associated exonuclease activity. The problems that 5* to 3* and 3* to 5' 
exonucleases can cause and ways to minimize their effects when present are discussed in Example 6. 
When testing to determine whether the modification of a polymerase decreases its ability to incorporate 
dideoxynucleotides, one class of mutants that can have this effect are ones that inactivate a normally very 
active 3' to 5' exonuclease activity (see for example Tabor and Richardson 84, Proc. Natl. Acad. Sci. 4767, 
1987). This class of mutants are not claimed in this patent. If one has a modified DNA polymerase that 
gives an apparent increase in the ability of the DNA polymerase to incorporate dideoxynucleotides, and one 
wants to determine whether it is in the polymerase domain or the exonuclease domain, it is necessary to 
carry out an exonuclease assay on the modified and unmodified forms of the enzyme; a mutation that 
affects primarily the exonuclease activity of the enzyme will have a greater effect on the exonuclease 
activity of the enzyme than on the polymerase activity. Preferably, one would measure the exonuclease 
activity on a DNA substrate label' - '* at its 3' end with M P-ddAMP (see Example 21). As in Example 6, it is 
important to inhibit. pyrophosphorolysis in these reactions in order to avoid it increasing the apparent 
discrimination against ddNTPs by a DNA polymerase. This is readily accomplished by including 
pyrophosphatase in the reaction. 

Example 8. Determ in ation of the efficiency of incorporation of dideoxynucleotides by inhibition of DNA 
synthesis on a singlw-stranded M13 DNA - unlabeled 40-mer primer complex 

In this example the sensitivity of a DNA polymerase to a ddNTP is determined by measuring the ability 
of varying concentrations of the ddNTP to inhibit a standard DNA synthesis reaction. The DNA synthesis 
assay is a modification of that described in Tabor and Richardson 264 J, Biol. Chem. 6447, 1989. The 40- 
mer primer and the Ml 3 mGP1-2 template are as described in Example 3. The primer is annealed to the 
M13 mGP1-2 single-stranded DNA template in a reaction mixture (1X = 25 ul) containing 2 ug of M13 
mGPt-2 DNA, 6 pmoles of primer (a 10-fold molar excess to template), 40 mM Tris-HCi, pH 8.0, 10 mM 
MgCI 2 and 100 mM NaCI. The mixture is incubated 65 *C for 2 min and then cooled to room temperature 
over 30 min. The standard reaction mixture (45 ul) contained 22 mM Tris«HCI, pH 8.0, 5.5 mM MgCb, 55 
mM NaCI, 300 uM dGTP, dATP, dCTP and [ 3 H]TTP (30 cpm/pmol), and varying concentrations of one of 
the four or all four ddNTPs. the reaction mixtures also contained 10 ng of yeast inorganic pyrophosphatase 
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to inhibit pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA 
polymerase (Tabor and Richardson 265 J. Biol. Cheat. 8322, 1990). The mixtures are incubated at 37 -C for 
1 min (70 *C for thermophilic DNA polymerases), and the reactions are initiated by the addition of 5 ul 
aliquots of dilutions (0.01 to 1 unit) of the DNA polymerase being analyzed, diluted in 20 mM Tris*HCI. pH 

5 7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin. The reaction mixtures are further 
incubated at 37* C for 10 min (70 *C for thermophilic DNA polymerases). The reactions are terminated by 
the addition of 5 ul of 100 mM EDTA, and 45 ul is spotted onto Whatman DE81 filter discs. The discs are 
washed in 4 changes of 1 50 ml of 0.3 M ammonium formate, p H 8.0. followed by 2 changes of 1 50 ml of 
90% ethanol, each of 5-10 min duration. The disks are then dried under a heat lamp and counted in a 

10 scintillation counter in the presence of 5 ml of fluor (Opti-Fluor 0. Packard). From the amount of 
radioactivity on each disk, the amount of total DNA synthesis is calculated. 

Specific DNA polymerases being tested may have optimum buffer, pH, salt or temperature conditions 
that differ from those suggested above. Each DNA polymerase should be tested under the conditions that 
give optimum specific polymerase activity for that enzyme. 

/s To determine whether a modification of a given DNA polymerase results in a decrease in its ability to 
discriminate against dideoxy nucleotides, first a series of reactions are carried out in the absence of 
ddNTPs. varying the DNA polymerase concentration to determine the range where the activity varies 
approximately linearly with enzyme concentration for both the modified and unmodified forms of the 
enzyme. An enzyme concentration is chosen that is in this linear range for both forms of the enzyme; e.g., 
20 an enzyme concentration whereby about of the template is replicated in the 10 min reaction is likely to 
be in such a linear range. 

Once a proper enzyme concentration is chosen, a series of reactions are carried out varying the amount 
of either one ddNTP or preferably all four ddNTPs in the mixture, in order to determine the concentration 
required to inhibit 50% of the DNA synthesis. For example, under the conditions stated above (300 uM 
25 4dNTPs), the following concentrations of a mixture of 4 ddNTPs are required to inhibit 50% of the DNA 
synthesis for the following six DNA polymerases: 



Polymerase 


[4ddNTP] for 50% inhibition 


77 DNA polymerase 


0.1 uM 


T7 DNA polymerase Y526F 


300 uM 


E. coli DNA polymerase I 


20 UM 


E. coli DNA polymerase I F762Y 


0.04 uM 


Taq DNA polymerase 


150UM 


Taq DNA polymerase F667Y 


0.4 uM 



This test can be used to determine if a modification in a new DNA polymerase results in a decrease in 
its ability to discriminate against ddNTPs; if the mutation does have this e fleet, then a higher concentration 
of 4ddNTPs will be required to inhibit 50% of the DNA synthesis in the assay described above. 

Example 9. Determination of the efficiency of incorp;.ation of dideoxynucleotides by measuring the 
incorporation of [a-^PJdAMP into synthetic primer-template complexes 

In this example the competition between a dNTP and a ddNTP is assayed for use at a single site in a 
synthetic primer-template. This assay differs from the others in that it limits the comparison of the use of 
the two substrates to a single site, avoiding the complication of sequence-specific variability in discrimina- 
tion. While this relatively simple assay is suitable for a preliminary screen of DNA polymerases for their 
ability to discriminate against ddNTPs. it should not be used to the exclusion of the assays presented in 
Examples 6-8 since often the discrimination against ddNTPs is strongly influenced by the neighboring 
sequences, an important problem for DNA sequence analysis (see for example Tabor and Richardson 265 
J. Biol. Chem. 8322. 1990). 

The two primer-templates shown below are used in this example. The first is used to determine the 
discrimination between dATP versus ddATP, while the second is used to determine the discrimination 
between dCTP versus ddCTP. dTTP versus ddTTP, and dGTP versus ddGTP. 
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Primer-Template A: 

5' GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 

s 

3 ' GCTGCAACATTTTGCTGCCGGTCACGGTTCCCC 5 ' 



10 



Primer-Template B: 

5' GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 
3' GCTGCAACATTTTGCTGCCGGTCACGGTCAGTTTT 5' 

20 

Each reaction mixture contains 25 pmoles each of primer and template. The primer and template are 
mixed together and annealed in a reaction mixture (1X = 10 ul) containing 40 mM Tris*HCI, pH 8.0, 10 
mM MgCl2 and 100 mM NaCI. The mixture is incubated 65 "C for 2 min and then cooled to room 

25 temperature over 30 min. The standard reaction mixture (45 ul) for reactions carried out with Primer- 
Template A contains 22 mM Tris*HCI, pH 8.0, 5.5 mM MgCl2, 55 mM NaCI, 25 pmoles of the Primer- 
Template A complex, 5 uM [a-^PJdGTP (4,000 cpm/pmole) and varying concentrations of dATP and 
ddATP. The reaction mixtures also contained 10 ng of yeast inorganic pyrophosphatase to inhibit 
pyrophosphorolysis that could otherwise increase the apparent discrimination by the DNA polymerase 

30 (Tabor and Richardson 265 J. Biol. Chem, 8322, 1990). The mixtures are incubated at 37 *C for 1 min 
(70 "C for thermophilic DNA polymerases), and the reactions are initiated by the addition of 5 ul aJiquots 
(0.01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM Tris*HCI, pH 7.5, 10 mM 2- 
mercaptoethanol and 0.05% bovine serum albumin. The reaction mixtures are further incubated at 37*C for 
10 min (70 *C for thermophilic DNA polymerases). The reactions are terminated by the addition of 5 ul of 

as 100 mM EDTA, and 45 ul is spotted onto Whatman DE81 filter discs. The discs are washed in 4 changes of 
150 ml of 0.3 M ammonium formate, p H 8.0, followed by 2 changes of 150 ml of 90% ethanol, each of 5- 
10 min duration. The disks are then dried under a heat lamp and counted in a scintillation counter in 5 ml of 
fluor (Opti-Fluor O, Packard). From the amount of radioactivity on each disk, the amount [ 3P P)dGMP 
incorporated was determined. The assumption is made once a single dAMP residue has been incorporated 

40 to remove the block for the incorporation of dGMP residues, four [^PJdGMPs will be incorporated into each 
primer, and thus the number of dAMPs incorporated are one fourth the number of dGMPs incorporated. 

Ail reactions are carried out with a constant amount of the DNA polymerase being analyzed; the amount 
of DNA polymerase should be sufficient to replicate 50% of the total dCMP residues in the single-stranded 
region of the template in the 10 min incubation in the presence of 10 uM dATP and the absence of ddATP. 

45 Specific DNA polymerases being tested may have optimum buffer, pH, salt, or temperature conditions that 
differ from those suggested above. Each DNA polymerase should be tested under the conditions that give 
optimum specific polymerase activity for that enzyme. Control reactions should also be carried out in which 
neither dATP nor ddATP are present; this defines the background DNA synthesis that should be subtracted 
from each sample. In general this is less than 10% of the DNA synthesis obtained when dATP is present. 

so Reactions are then carried out with 10 uM dATP and varying concentrations of ddATP, to determine the 
amount of ddATP required to inhibit DNA synthesis by 50%. Examples are shown in the Table below for 
the concentration of ddATP required to inhibit 50% of the DNA synthesis in the presence of 10 uM dATP. 
The polymerases are defined as in Example 6. 

55 
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Polymerase 


[ddATP] for 50% inhibition 


T7 DNA polymerase 


-30 uM 


T7 DNA polymerase Y526F 


>500uM 


E. coli DNA polymerase I 


>500 uM 


E. coli DNA polymerase I F762Y 


-6uM 


Taq DNA polymerase 


>500uM 


Taq DNA polymerase F667Y 


-5 uM 



In order to carry out an analogous test measuring the discrimination against ddGTP, ddTTP or ddCTP, 
reactions are carried out identical to that described above except that Primer-Template B is used instead of 
Primer-Template A, and the reactions containing 10 uM dGTP, dTTP and dCTP and 5 uM [a-^PJdATP 
(4,000 cpm/pmole) and varying concentrations of either ddGTP, ddTTP or ddCTP. 

As with the other examples. DNA polymerases with a 3' to 5' exonuclease activity can interfere with this 
assay, making an enzyme more discriminatory against ddNTPs than that due to discrimination at the level 
of incorporation of the analog. In addition, enzymes with high levels of exonuclease activity can use up all 
the dNTPs in the reaction mixture (especially with the relatively low level of dNTPs present in these 
reactions), resulting in no net DNA synthesis (e.g. the native T7 DNA polymerase, see Tabor and 
Richardson 264 */. Biol. Chem. 6447. 1989). In these cases the concentration of DNA polymerase and the 
incubation time of the reaction should be adjusted to obtain the maximum level of DNA synthesis in the 
absence of ddNTPs. 

Example 10. Determination of the efficiency of incorporation fa-^PjddNMPs into a synthetic primer-template 
complex — 

In this example the competition between a dNMP and a ddNMP is assayed for incorporation at a single 
site in a synthetic primer-template. This assay differs from that in Example 9 in that the label is in [a-^P] 
ddATP. and thus the incorporation of ddAMP is being measured. This assay can be used to test whether a 
ddNTP is inhibiting a DNA polymerase by acting as a chain terminator, being incorporated into the 3' end of 
the primer, or simply by binding the DNA polymerase and preventing further DNA synthesis without actually 
being incorporated into the primer. 

In the example below the incorporation of ddAMP is measured using [a- 32 P]ddATP and Primer- 
Template A (Example 9): 

Primer-Template A: 



5' GGCGACGTTGTAAAACGACGGCCAGTGCCA 3' 
3 ' GCTGCAACATTTTGCTGCCGGTCACGGTTCCCC 5 ' 



Incorporation of [a-^PJddGMP, [a-^PJddCMP and [a- M P]ddTMP could be similarly tested on the 
appropriate template (for example. Primer-Template B in Example 9). 

Each reaction mixture contains 25 pmoles each of primer and template (Primer-Template A, see 
above). The primer and template are mixed together and annealed in a reaction mixture (IX = 10 ul) 
containing 40 mM Tris*HCI, pH 8.0. 10 mM MgCI 2 and 100 mM NaCI. The mixture is incubated 65 *C for 
2 min and then cooled to room temperature over 30 min. The standard reaction mixture (45 ul) contains 22 
mM Tris-HCI. pH 8.0. 5.5 mM MgCfe, 55 mM NaCI, 25 pmoles of the Primer-Template A complex, 2.5 uM 
[a-^PJddATP (Amersham PB10235. >5000 Ci/mmol diluted with cold ddATP to a specific activity of 4,000 
cpm/pmole) and varying concentrations of dATP. The reaction mixtures also contained 10 ng of yeast 
inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent dis- 
crimination by the DNA polymerase (Tabor and Richardson 265 J. Biol. Chem. 8322. 1990). The mixtures 
are incubated at 37 *C for 1 min (70 'C for thermophilic DNA polymerases), and the reactions are initiated 
by the addition of 5 ul aliquots (0.01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM 
Tris«HCI, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin. The reaction mixtures are 
further incubated at 37 "C for 10 min (70 *C for thermophilic DNA polymerases). The reactions are 
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terminated by the addition of 5 ul of 100 mM EDTA. and 45 ul is spotted onto Whatman DE81 filter discs 
The discs are washed in 4 changes of 150 ml of 0.3 M ammonium formate, p H 8.0 followed by 2 chanoTs 
of 150 ml of 90% ethanol. each of 5-10 min duration. The disks are then dried under a heat lamo and 
counted in a scintillation counter 5 ml of fluor (Opti-Fluor O. Packard). From the amount of radioactivity on 

i each disk, the amount pPJddAMP incorporated is determined. 

nw . A " reactions ar * carried out with a constant amount of the DNA polymerase being tested; the amount of 
polymerase should be that concentration which gives the highest level of incorporation of FPJddAMP 
into Primer-Template A in the 10 min incubation in the absence of dATP. Specific DNA polymerases being 
tested may have optimum buffer, pH t salt or temperature conditions that differ from those suggested 

> above. Each DNA polymerase should be tested under the conditions that give optimum specific polymerase 
activity for that enzyme. 

In order to use this assay to determine the level of discrimination against a ddNTP reactions are 
earned out w.th a constant amount of the DNA polymerase and [»P]ddATP, in the presence or absence of 
2.5 uM dATP (an equimolar concentration to pPJddATP) and the effect the presence of dATP has on the 
incorporation of [*P]ddAMP is determined. If a DNA polymerase does not discriminate between the 
incorporation of ddAMP and dAMP, and it has no 3* to 5' exonuclease activity, then the addition of dATP 
will inhibit incorporation of [^PJddAMP by 50%. 

This test is best for a DNA polymerase that efficiently incorporates ddNMPs, such as Taq DNA 
polymerase F667Y. For DNA polymerases that discriminate strongly against ddNMPs. the previous assay is 
preferred, .n which the label is in a dNTP other than the one being used in competition with the ddNTP 
since in that case much higher concentrations of the ddNTP can be used. 

However, with DNA polymerases that discriminate strongly against ddNMPs. if one is interested in 
testing whether a given mutation is reducing the level of discrimination against ddNMPs. this assay could 
be used by assaying the unmodified DNA polymerase on this substrate in the absence of dATP (measuring 
the incorporation of pPJddAMP as a function of DNA polymerase concentration), and comparing the rate of 
incorporation to that of the mutant enzyme. If the mutation is reducing the discrimination against ddATP, 
then the mutant enzyme should have a higher specific activity for incorporation of pPjddAMP. 

As with the other examples. DNA polymerases with a 3' to 5' exonuclease activity can interfere with this 
assay, making an enzyme more discriminatory against ddNTPs than that due to discrimination at the level 
of incorporation of the analog. And as in Example 9. enzymes with high levels of exonuclease activity can 
deplete all the dNTPs, resulting in no net incorporation of [^PJddAMP. In these cases the concentration of 
DNA polymerase and the incubation time of the reaction should be adjusted to obtain the maximum level of 
incorporation of pPJddAMP by the DNA polymerase being tested. 

All of the above methods are based on radioactivity to detect either the length of the extended primer 
or the amount of DNA synthesis on the primer. The efficiency of incorporation of dideoxynucleotides by a 
DNA polymerase can also be measured nonradioactive^. Two examples are presented below that make 
use of either fluorescent primers or fluorescent dye-dideoxy terminators that are detected on an Applied 
Biosystems Model 373A DNA Sequencing System. 

Example 11. Determinati on of the efficiency of incorporation of dideoxynucleotides using a fluoresc ent 
primer annealed to single-stranded DNA and gel electrophoresis 

In this example a fluorescently labeled primer is annealed to single-stranded DNA and DNA synthesis 
reactions are carried out using varying ratios of dNTPs to ddNTPs. The samples are then loaded onto an 
Applied Biosystems Model 373A DNA Sequencing Syste- *nd the length of each fluorescent fragment is 
determined by direct fluorescent detection during gel electrophoresis. Reactions are carried out as 
described in Tabor and Richardson 265 J. Biol. Chem. P322. 1990. The fluorescent primer used is Tarn" 
primer (Applied Biosystems). The DNA used is single-stranded M13 mGPi-2 DNA as described in Example 

The primer is annealed to the Ml 3 mGP1 -2. single-stranded DNA template in a reaction mixture (1X = 
10 ul) containing 2 ug of M13 mGP1-2 DNA, 5 ng of primer. 40 mM Tris*HCI. pH 8.0. 10 mM MgCI 2 and 
100 mM NaCI. The mixture is incubated 65 'C for 2 min and then cooled to room temperature over 30 min. 
The standard reaction mixture (18 ul) contains 22 mM Tris-HCI, pH 8.0, 5.5 mM MgCI 2 . 55 mM NaCI, and 
varying concentrations of the 4dNTPs and one of the four ddNTPs. The reaction mixtures also contained 10 
ng of yeast inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the 
apparent discrimination by the DNA polymerase (Tabor and Richardson 265 J. Biol. Chem. 8322. 1990). 
The mixtures are incubated at 37 *C for 1 min (70 'C for thermophilic DNA polymerases), and the reactions 
are initiated by the addition of 2 ul aliquots (0.01 to 1 unit) of the DNA polymerase being analyzed diluted 
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in 20 mM Tris«HCt, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin. The reaction 
mixtures are further incubated at 37 *C for 10 min (70 *C for thermophilic DNA polymeras s). The reactions 
are by the addition of 8 ul of 20 mM DTPA. 1 M potassium acetate, pH 5.0, and 60 ul of ethanol. After 
centrifugation, the DNA is resuspended in 6 ul of 80% form amide. 10 mM Tris*HCI, pH 8.0, and 1 mM 

5 EDTA, and heated at 80 *C for 2 min immediately prior to loading on the polyacrylamide gel on th 
Applied Biosystems 373A ON A Sequencing System following the manufacturer's procedures. 

Specific DNA polymerases being tested may have optimum buffer, pH, salt or temperature conditions 
that differ from those suggested above. Each DNA polymerase should be tested under the conditions that 
give optimum specific polymerase activity for that enzyme. The concentration of DNA polymerase should 

io be sufficient to extend most of the primers at least several hundred nucleotides or until a dideoxynucleotide 
has been incorporated in the 10 min reaction. 

The ratio of dNTPs to ddNTPs is adjusted to obtain optimum peak intensities for approximately three 
hundred bases. For example, for Taq DNA polymerase approximately 10 uM 4dNTPs and 200600 uM 
ddNTPs is optimal, while for Taq DNA polymerase F667Y approximately 300 uM 4dNTPs and 0.5-5 uM 

is ddNTPs is optimal. 

To determine whether a modification of a given DNA polymerase results in a decrease in its ability to 
discriminate against dideoxynucleotides, reactions should be carried out at varying ratios of dNTPs to 
ddNTPs for both the unmodified and modified DNA polymerases, and the intensities of dideoxy-terminated 
fragments of different lengths are compared to determine whether the modified DNA polymerase is using 
20 ddNTPs more efficiently than the unmodified enzyme. 

Example 12. Determination of the efficiency of incorporation of fluorescent dideoxynucleotides by gel 
electrophoresis 

25 In this example a nonfluorescent primer is annealed to single-stranded DNA and DNA synthesis 
reactions are carried out using varying ratios of dNTPs to a single fluorescently labeled-ddNTP. The 
samples are then loaded onto an Applied Biosystems Model 373A DNA Sequencing System, and the length 
of each fluorescent fragment is determined by direct fluorescent detection during gel electrophoresis. The 
primer used in this example is the 40-mer described in Example 3, and the template is the single-stranded 

30 M13 mGP1-2 described in Example 3. The primer is annealed to the M13 mGP1-2 single-stranded DNA 
template in a reaction mixture (1X = 10 ul) containing 2 ug of M13 mGP1-2 DNA, 6 pmoles of primer (a 
10-fold molar excess to template), 40 mM Tris«HCI, pH 8.0, 10 mM MgCb and 100 mM NaCI. The mixture 
is incubated 65 *C for 2 min and then cooled to room temperature over 30 min. The standard reaction 
mixture (18 ul) contains 22 mM Tris«HCI, pH 8.0, 5.5 mM MgCb. 55 mM NaCI, and varying concentrations 

35 of the 4dNTPs plus one of the four ftuorescently-labeled ddNTPs. The four fluorescently labeled ddNTPs 
are from Applied Biosystems (Taq DyeDeoxy Terminator Cycle Qequencing Kit, part number 401150), and 
are referred to as G, A, T or C "DyeDeoxy Terminators" (manual for Taq DyeDeoxy Terminator Cycle 
Sequencing Kit, part number 901497, Rev. E). The reaction mixtures also contained 10 ng of yeast 
inorganic pyrophosphatase to inhibit pyrophosphorolysis that could otherwise increase the apparent dis- 

40 crimination by the DNA polymerase (Tabor and Richardson 265 J. BioL Chem. 8322. 1990). The mixtures 
are incubated at 37 *C for 1 min (70 *C for thermophilic DNA polymerases), and the reactions are initiated 
by the addition of 2 ul aliquots (0.01 to 1 unit) of the DNA polymerase being analyzed diluted in 20 mM 
Tris*HCl, pH 7.5, 10 mM 2-mercaptoethanol and 0.05% bovine serum albumin. The reaction mixtures are 
further incubated at 37 *C for 10 min (70 # C for thermophilic DNA polymerases). The reactions are by the 

45 addition of 8 ul of 20 mM EDTA 1 M potassium acetate, pH 5.0, and 60 ul of ethanol. After centrifugation, 
the DNA is resuspended in 6 ul of 80% formamide, 10 mM Tris«HCI. pH 8.0, and 1 mM DTPA, and heated 
at 80 * C for 2 min immediately prior to loading on the polyacrylamide gel on the Applied Biosystems 373A 
DNA Sequencing System following the manufacturer's procedures . 

Specific DNA polymerases being tested may have optimum buffer, pH, salt, or temperature conditions 

so that differ from those suggested above. Each DNA polymerase should be tested under the conditions that 
give optimum specific polymerase activity for that enzyme. The concentration of DNA polymerase used in 
these reactions should be that concentration that is sufficient to extend most of the primers at least several 
hundred nucleotides or until a dideoxynucleotide has been incorporated in the 10 min reaction. For DNA 
polymerases that have a 5' to 3' exonuclease activity, such as Taq DNA polymerase, the DNA polymerase 

55 concentration must be kept low enough to avoid this activity degrading a significant percentage of the 5' 
ends of the fragments. 

To determine whether a DNA polymerase discriminates strongly or weakly against a fluorescent ddNTP, 
reactions are carried out using 20 uM 4dNTPs and 0.01 ul of each DyeDeoxy Terminator provided by 
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Applied Biosyst ms (part number 401150). When Taq ONA polymerase is used under these conditions, 
most of the fluorescence is eith r in unincorporated dye-ddNTPs at the leading front of the gel. or in 
fragments greater than several hundred bases in length. In contrast, with the Taq ONA polymerase mutant 
F667Y, under these conditions most of the fluorescence is in fragments that are less than several hundred 
bases in length, and a significantly lower percentage of the total fluorescence is in unincorporated dye- 
ddNTPs at the leading front of the gel. 

To determine whether a modification of a given DNA polymerase results in a decrease in its ability to 
discriminate against dideoxynucleotides, reactions are carried out at varying ratios of dNTPs to DyeOeoxy 
Terminators for both the unmodified and modified ONA polymerases, and the average length of the 
resulting fluorescent fragments are compared to determine whether the modified ONA polymerase is using 
the DyeOeoxy Terminators more efficiently than the unmodified enzyme. 

The following examples provide tests for determining the uniformity of band intensities produced from 
dideoxy-terminated fragments synthesized by different DNA polymerases. 

Example 13. Determination of uniformity of incorporation of dideoxynucleotides using a single-stranded M13 
DNA - 5' ^P-labeled 40-mer primer complex and gel electrophoresis 

In this example the uniformity of dideoxynucleotide incorporation is measured on a 5* ^P-end labeled 
primer extended on a single-stranded M13 ONA template. Three activities can cause variability in band 
intensity of dideoxy-terminated fragments. One is exonuclease activity that can preferentially at some 
sequences; this is avoided by removal of the activity selectively either by chemical or genetic means (see 
for example Tabor and Richardson 264. J. Biol. Chem. 6447, 1989). The second is pyrophosphorolysis; this 
is readily avoided by including pyrophosphatase in the reaction mixture, which degrades the pyrophosphate 
that accumulates during DNA synthesis and is a necessary substrate for pyrophosphorolysis. The third is 
sequence-specific variability in the incorporation of dideoxynucleotides. Variability in band intensity is 
detrimental to ONA sequence analysis, decreasing the accuracy of the ONA sequence determined. This test 
is designed to compare the degree of. variability in band intensities in fragments synthesized by different 
ONA polymerases, including mutant DNA polymerases that may incorporate dideoxynucleotides more 
efficiently. 

The primer, template and reaction conditions in this Example are identical to that described in 
Examples 6 and 7. The template is M13 mGP1-2 single-stranded ONA described in Example 3, and the 
primer is the 40 mer also described in Example 3. Reaction conditions used are those that are optimum for 
the DNA polymerase being tested with regard to buffer, pH, salts, and temperature of the reaction. It is 
preferred that magnesium is the only metal ion present in the reaction mixture (i.e. the reactions are carried 
out in the absence of added manganese ions). A concentration of DNA polymerase is chosen whereby most 
of the primers are extended in a 10 min reaction and are terminated by the incorporation of a dideox- 
ynucleotide. The ratios of dNTPs to ddNTPs are adjusted for the specific ONA polymerase being tested so 
that the average fragment size is approximately 100-300 nucleotides. ddCTP is the preferred ddNTP to use 
for the test of uniformity since fragments terminated with this dideoxynucleotide tend to have the most 
variability in intensities (see for example Tabor and Richardson 86 Rroc. Natl. Acad. Set. 4076. 1989). Gel 
electrophoresis, autoradiography, and analysis of the band intensities by either scanning of the gel or 
phosphoimager analysis are as described in Example 6. Electrophoresis is carried out until fragments of 
approximately 55 nucleotides in length are at the bottom of the gel (the dye bromphenol blue has run off 
the bottom of the gel and the dye xylene cyanol is approximately 8 cm from the bottom of the gel). 

For a given series of ddNMP-terminated fragments, for example a series of ddCMP-terminated 
fragments, the intensities of the first 20 fragments from the bottom of the gel are determined, preferably by 
Phosphoimager analysis. Alternatively, the autoradiograph can be scanned by an imaging densitometer to 
determine the relative intensities of the first 20 fragments. These intensities are then analyzed statistically 
as described in Example 6 in order to determine their variability. For example, the values can be plotted 
using the Macintosh program Kaleidograph Version 3.0 (Synergy Software). The resulting plots are fit to an 
exponential decay curve using the Kaleidograph library routine for this function. R 2 , the correlation index for 
the data, is calculated by the Kaleidograph library routine. This is a measure of the variability in band 
intensities. The values obtained for R 2 using a new DNA polymerase are compared to those obtained using 
known ONA polymerases, for example _28 T7 DNA polymerase (Sequenase Version 2.0. United States 
Biochemical Corporation) in the presence of magnesium or manganese (see Tabor and Richardson 265 J. 
Biol. Chem. 8322. 1990), E. coli DNA polymerase I (either Klenow fragment or Klenow fragment with the 
mutation F762Y) or Taq DNA polymerase (either wild-type or the mutant F667Y). The R 2 values obtained 
with these known DNA polymerases are used as standards by which to compare a new DNA polymerase 
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